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NOVEL COMPOSITIONS AND METHODS FOR THE TREATMENT OF IMMUNE RELATED 

DISEASE 

Field of the Invention 

The present invention relates to compositions and methods useful for the diagnosis and alleviation 
of immune related disease. 

Background of the Invention 
Immune related and inflammatory diseases are the manifestation or consequence of fairly complex, 
often multiple interconnected biological pathways which in normal physiology are critical to respond to 
insult or injury, initiate repair from insult or injury, and mount innate and acquired defense against foreign 
organisms. Disease or pathology occurs when these normal physiological pathways cause additional insult 
or injury either as directly related to the intensity of the response, as a consequence of abnormal regulation 
or excessive stimulation, as a reaction to self, or as a combination of these. 

Though the genesis of these diseases often involves multistep pathways and often multiple different 
biological systems/pathways, intervention at critical points in one or more of these pathways can have an 
ameliorative or therapeutic effect. Therapeutic intervention can occur by either antagonism of a detrimental 
process/pathway or stimulation of a beneficial process/pathway. 

Many immune related diseases are known and have been extensively studied. Such diseases 
include immune-mediated inflammatory diseases, non-immune-mediated inflammatory diseases, infectious 
diseases, immunodeficiency diseases, neoplasia, etc. 

T lymphocytes (T cells) are an important component of a mammalian immune response. T cells 
recognize antigens which are associated with a self-molecule encoded by genes within the major 
histocompatibility complex (MHC). The antigen may be displayed together with MHC molecules on the 
surface of antigen presenting cells, virus infected cells, cancer cells, grafts, etc. The T cell system eliminates 
these altered cells which pose a health threat to the host mammal. T cells include helper T cells and 
cytotoxic T cells. Helper T cells proliferate extensively following recognition of an antigen -MHC complex 
on an antigen presenting cell. Helper T cells also secrete a variety of cytokines, U, lymphokines, which 
play a central role in the activation of B cells, cytotoxic T cells and a variety of other cells which participate 
in the immune response. 

Several diseases of the skin are correlated with an aberrant T cell response and to autoimmunity. 
Psoriasis is thought to be an autoimmune disease. Specifically, T-cells of the immune system recognize a 
protein in the skin and attack the area where that protein is found, causing the too-rapid growth of new skin 
cells and painful, elevated, scaly lesions. These lesions are characterized by hyperproliferation of 
keratinocytes and the accumulation of activated T-cells in the epidermis of the psoriatic lesions. There are 
several forms of psoriasis; guttate is the one that most commonly occurs in children and teens. It is 
sometimes preceded by an upper respiratory infection. Guttate psoriasis is noncontagious and characterized 
by small drop-like lesions, usually scattered over the trunk, limbs and scalp. According to the National 



WO 2004/081199 



PCT/US2004/007862 



Psoriasis Foundation, approximately seven million people in the United States have psoriasis. About 20,000 
children are diagnosed with psoriasis annually, and many of the cases are attributed to upper respiratory 
infections. It is estimated that only about 1.5 million people with psoriasis actually seek treatment, primarily 
due to lack of or dissatisfaction with current treatments Although the initial molecular cause of disease is 
5 unknown, genetic linkages have been mapped to at least 7 psoriasis susceptibility loci (Psorl on 6p21.3, 
Psor2 on I7q, Psor3 on 4q, Psor4 on I cent-q2l, Psor5 on 3q21, Psor6 on 19pl3, and Psor7 on lp). Some of 
these loci overlap with other autoimmune/inflammatory diseases including rheumatoid arthritis, atopic 
dermatitis, and irritable bowel disease. In this application, experiments determine that a gene is upregulated 
in psoriatic skin vs. normal skin. 
10 The term inflammatory bowel disorder ("IBD") describes a group of chronic inflammatory 

disorders of unknown causes in which the intestine (bowel) becomes inflamed, often causing recurring 
cramps or diarrhea. The prevalence of IBD in the US is estimated to be about 200 per 100,000 population. 
Patients with IBD can be divided into two major groups, those with ulcerative colitis ("UC") and those with 
Crohn's disease ("CD"). 

15 In patients with UC, there is an inflammatory reaction primarily involving the colonic mucosa. The 

inflammation is typically uniform and continuous with no intervening areas of normal mucosa. Surface 
mucosal cells as well as crypt epithelium and submucosa are involved in an inflammatory reaction with 
neutrophil infiltration. Ultimately, this situation typically progresses to epithelial damage with loss of 
epithelial cells resulting in multiple ulcerations, fibrosis, dysplasia and longitudinal retraction of the colon. 

20 CD differs from UC in that the inflammation extends through all layers of the intestinal wall and 

involves mesentery as well as lymph nodes. CD may affect any part of the alimentary canal from mouth to 
anus. The disease is often discontinuous, i.e., severely diseased segments of bowel are separated from 
apparently disease-free areas. In CD, the bowel wall also thickens which can lead to obstructions. In 
addition, fistulas and fissures are not uncommon. 

25 Clinically, IBD is characterized by diverse manifestations often resulting in a chronic, unpredictable 

course. Bloody diarrhea and abdominal pain are often accompanied by fever and weight loss. Anemia is not 
uncommon, as is severe fatigue. Joint manifestations ranging from arthralgia to acute arthritis as well as 
abnormalities in liver function are commonly associated with IBD. Patients with IBD also have an increased 
risk of colon carcinomas compared to the general population. During acute "attacks" of IBD, work and other 

30 normal activity are usually impossible, and often a patient is hospitalized. 

Although the cause of IBD remains unknown, several factors such as genetic, infectious and 
immunologic susceptibility have been implicated. IBD is much more common in Caucasians, especially 
those of Jewish descent. The chronic inflammatory nature of the condition has prompted an intense search 
for a possible infectious cause. Although agents have been found which stimulate acute inflammation, none 

35 has been found to cause the chronic inflammation associated with IBD. The hypothesis that IBD is an 
autoimmune disease is supported by the previously mentioned extraintestinal manifestation of IBD as joint 
arthritis, and the known positive response to IBD by treatment with therapeutic agents such as adrenal 
glucocorticoids, cyclosporine and azathioprine, which are known to suppress immune response. In addition, 
the GI tract, more than any other organ of the body, is continuously exposed to potential antigenic 

40 substances such as proteins from food, bacterial byproducts (LPS), etc. 
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Once the diagnosis has been made, typically by endoscopy, the goals of therapy are to induce and 
maintain a remission. The least toxic agents which patients are typically treated with are the 
aminosalicylates. Sulfasalazine (Azulfidine), typically administered four times a day, consists of an active 
molecule of aminosalicylate (5-ASA) which is linked by an azo bond to a sulfapyridine. Anaerobic bacteria 
in the colon split the azo bond to release active 5-ASA. However, at least 20% of patients cannot tolerate 
sulfapyridine because it is associated with significant side-effects such as reversible sperm abnormalities, 
dyspepsia or allergic reactions to the sulpha component. These side effects are reduced in patients taking 
olsalazine. However, neither sulfasalazine nor olsalazine are effective for the treatment of small bowel 
inflammation. Other formulations of 5-ASA have been developed which are released in the small intestine 
(e.g. mesalamine and asacol). Normally it takes 6-8 weeks for 5-ASA therapy to show full efficacy. 
Patients who do not respond to 5-ASA therapy, or who have a more severe disease, are prescribed 
corticosteroids. However, this is a short term therapy and cannot be used as a maintenance therapy. Clinical 
remission is achieved with corticosteroids within 2-4 weeks, however the side effects are significant and 
include a Cushing goldface, facial hair, severe mood swings and sleeplessness. The response to 
sulfasalazine and 5-aminosalicylate preparations is poor in Crohn's disease, fair to mild in early ulcerative 
colitis and poor in severe ulcerative colitis. If these agents fail, powerful immunosuppressive agents such as 
cyclosporins, prednisone, 6-mercaptopurine or azathioprine (converted in the liver to 6-mercaptopurine) are 
typically tried. For Crohn's disease patients, the use of corticosteroids and other immunosuppressives must 
be carefully monitored because of the high risk of intra-abdominal sepsis originating in the fistulas and 
abscesses common in this disease. Approximately 25% of IBD patients will require surgery (colectomy) 
during the course of the disease. 

Further, the risk of colon cancer is elevated (;> 32X) in patients with severe ulcerative colitis, 
particularly if the disease has existed for several years. About 20-25% of patients with IBD eventually 
require surgery for removal of the colon because of massive bleeding, chronic debilitating illness, 
performation of the colon, or risk of cancer. Surgery is also sometimes performed when other forms of 
medical treatment fail or when the side effects of steroids or other medications threaten the patient's health. 
As surgery is invasive and drastically life altering, it is not a highly desireable treatment regimen, and is 
typically the treatment of last resort. 

In addition to pharmaceutical medicine and surgery, nonconventional treatments for IBD such as 
nutritional therapy have also been attempted. For example, Rexical®, a semi-elemental formula, has been 
shown to be as effective as the steroid prednisolone. Sanderson et al t Arch. Dis. Child. 51:123-7 (1987). 
However, semi-elemental formulas are relatively expensive and are typically unpalatable - thus their use has 
been restricted. Nutritional therapy incorporating whole proteins has also been attempted to alleviate the 
symptoms of IBD. Giafer et ai, Lancet 335: 816-9 (1990). U.S.P. 5,461,033 describes the use of acidic 
casein isolated from bovine milk and TGF-2. Beattie et al, Aliment Pharmacol Ther. 8: 1-6 (1994) 
describes the use of casein in infant formula in children with IBD. U.S.P. 5,952,295 describes the use of 
casein in an enteric formulation for the treatment of IBD. However, while nutrional therapy is non-toxic, it 
is only a palliative treatment and does not treat the underlying cause of the disease. 

Despite the above identified advances in immune research, there is a great need for additional 
diagnostic and therapeutic agents capable of detecting the presence of an immune related disease in a 
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mammal and for effectively inhibiting these afflictions. Accordingly, it is an objective of the present 
invention to identify polypeptides that are overexpressed in immune related disease states as compared to 
normal tissue, and to use those polypeptides, and their encoding nucleic acids, to produce compositions of 
matter useful in the therapeutic treatment and diagnostic detection of immune related disease in mammals. 

5 

Summary of the Invention 

A. Embodiments 

The present invention concerns compositions and methods useful for the diagnosis and treatment of 
immune related disease in mammals, including humans. The present invention is based on the identification 

10 of proteins (including agonist and antagonist antibodies) which are a result of immune related disease in 
mammals. Immune related diseases such as psoriasis or IBD may be treated by suppressing the immune 
response. Molecules mat enhance the immune response stimulate or potentiate the immune response to an 
antigen. Molecules which, stimulate the immune response can be used therapeutically where enhancement of 
the immune response would be beneficial. Alternatively, molecules that suppress the immune response 

15 attenuate or reduce the immune response to an antigen (e.g., neutralizing antibodies) can be used 
therapeutically where attenuation of the immune response would be beneficial (e.g., inflammation). 
Accordingly, the PRO polypeptides, agonists and antagonists thereof are also useful to prepare medicines 
and medicaments for the treatment of immune related disease. In a specific aspect, such medicines and 
medicaments comprise a therapeutically effective amount of a PRO polypeptide, agonist or antagonist 

20 thereof with a pharmaceutical^ acceptable carrier. Preferably, the admixture is sterile. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
PRO polypeptide which comprises contacting the PRO polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a 
native sequence PRO polypeptide. In a specific aspect, the PRO agonist or antagonist is an anti-PRO 

25 antibody. 

In another embodiment, the invention concerns a composition of matter comprising a PRO 
polypeptide or an agonist or antagonist antibody which binds die polypeptide in admixture with a carrier or 
excipient. In one aspect, the composition comprises a therapeutically effective amount of the polypeptide or 
antibody. In a further aspect, when the composition comprises a immune related disease inhibiting 

30 molecule, the composition is useful for: (a) reducing the amount of psoriasis or affected colon tissue in a 
mammal in need thereof, (b) inhibiting or reducing an auto-immune response in a mammal in need thereof, 
In another aspect, the composition comprises a further active ingredient, which may, for example, be a 
further antibody or a cytotoxic or chemotherapeutic agent Preferably, the composition is sterile. 

In another embodiment, the invention concerns a method of treating immune related disease in a 

35 mammal in need thereof, comprising administering to the mammal an effective amount of a PRO 
polypeptide, an agonist thereof, or an antagonist thereto. 

In another embodiment, the invention provides an antibody which specifically binds to any of the 
above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. In one aspect, the present invention concerns an 

40 isolated antibody which binds a PRO polypeptide. In another aspect, the antibody mimics the activity of a 
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PRO polypeptide (an agonist antibody) or conversely the antibody inhibits or neutralizes the activity of a 
PRO polypeptide (an antagonist antibody). In another aspect, the antibody is a monoclonal antibody, which 
preferably has nonhuman complementarity determining region (CDR) residues and human framework region 
(FR) residues. The antibody may be labeled and may be immobilized on a solid support. In a further aspect, 
the antibody is an antibody fragment, a monoclonal antibody, a single-chain antibody, or an anti-idiotypic 
antibody. 

In yet another embodiment, the present invention provides a composition comprising an anti-PRO 
antibody in admixture with a pharmaceutically acceptable carrier. In one aspect, the composition comprises 
a therapeutically effective amount of the antibody. Preferably, the composition is sterile. The composition 
may be administered in the form of a liquid pharmaceutical formulation, which may be preserved to achieve 
extended storage stability. Alternatively, the antibody is a monoclonal antibody, an antibody fragment, a 
humanized antibody, or a single-chain antibody. 

In a further embodiment, the invention concerns an article of manufacture, comprising: v 

(a) a composition of matter comprising a PRO polypeptide or agonist or antagonist thereof; 

(b) a container containing said composition; and 

(c) a label affixed to said container, or a package insert included in said container referring to 
the use of said PRO polypeptide or agonist or antagonist thereof in the treatment of an immune related 
disease. The composition may comprise a therapeutically effective amount of the PRO polypeptide or the 
agonist or antagonist thereof. 

In yet another embodiment, the present invention concerns a method of diagnosing immune related 
disease in a mammal, comprising detecting the level of expression of a gene encoding a PRO polypeptide (a) 
in a test sample of tissue cells obtained from the mammal, and (b) in a control sample of known normal 
tissue cells of the same cell type, wherein a higher or lower expression level in the test sample as compared 
to the control sample indicates the presence of immune related disease in the mammal from which the test 
tissue cells were obtained. 

In another embodiment, the present invention concerns a method of diagnosing immune related 
disease in a mammal, comprising (a) contacting an anti-PRO antibody with a test sample of tissue cells 
obtained from the mammal, and (b) detecting the formation of a complex between the antibody and a PRO 
polypeptide, in the test sample; wherein the formation of said complex is indicative of the presence or 
absence of said immune related disease. The detection may be qualitative or quantitative, and may be 
performed in comparison with monitoring the complex formation in a control sample of known normal 
tissue cells of the same cell type. A larger quantity of complexes formed in the test sample indicates the 
presence or absence of immune related disease in the mammal from which the test tissue cells were 
obtained. The antibody preferably carries a detectable label. Complex formation can be monitored, for 
example, by light microscopy, flow cytometry, fluorimetry, or other techniques known in the art. The test 
sample is usually obtained from an individual suspected of having immune related disease. 

In another embodiment, the invention provides a method for determining the presence of a PRO 
polypeptide in a sample comprising exposing a test sample of cells suspected of containing the PRO 
polypeptide to an anti-PRO antibody and determining the binding of said antibody to said cell sample. In a 
specific aspect, the sample comprises a cell suspected of containing the PRO polypeptide and the antibody 
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binds to the cell. The antibody is preferably detectably labeled and/or bound to a solid support. 

In another embodiment, the present invention concerns an immune related disease diagnostic kit, 
comprising an anti-PRO antibody and a carrier in suitable packaging. The kit preferably contains 
instructions for using the antibody to detect the presence of the PRO polypeptide. Preferably the carrier is 
5 pharmaceutical^ acceptable. 

In another embodiment, the present invention concerns a diagnostic kit, containing an anti-PRO 
antibody in suitable packaging. The kit preferably contains instructions for using the antibody to detect the 
PRO polypeptide. 

In another embodiment, the invention provides a method of diagnosing an immune related disease 
10 in a mammal which comprises detecting the presence or absence or a PRO polypeptide in a test sample of 
tissue cells obtained from said mammal, wherein the presence or absence of the PRO polypeptide in said test 
sample is indicative of the presence of immune related disease in said mammal. 

In another embodiment, the present invention concerns .a method for identifying an agonist of a 
PRO polypeptide comprising: 
15 (a) contacting cells and a test compound to be screened under conditions suitable for the induction 

of a cellular response normally induced by a PRO polypeptide; and 

(b) determining the induction of said cellular response to determine if the test compound is an 
effective agonist, wherein the induction of said cellular response is indicative of said test compound being an 
effective agonist. 

20 In another embodiment, the invention concerns a method for identifying a compound capable of 

inhibiting the activity of a PRO polypeptide comprising contacting a candidate compound with a PRO 
polypeptide under conditions and for a time sufficient to allow these two components to interact and 
determining whether the activity of the PRO polypeptide is inhibited. In a specific aspect, either the 
candidate compound or the PRO polypeptide is immobilized on a solid support. In another aspect, the non- 
25 immobilized component carries a detectable label. In a preferred aspect, this method comprises the steps of: 

(a) contacting cells and a test compound to be screened in the presence of a PRO polypeptide under 
conditions suitable for the induction of a cellular response normally induced by a PRO polypeptide; and 

(b) determining the induction of said cellular response to determine if the test compound is an 
effective antagonist. 

30 In another embodiment, the invention provides a method for identifying a compound that inhibits 

the expression of a PRO polypeptide in cells that normally express the polypeptide, wherein the method 
comprises contacting the cells with a test compound and determining whether the expression of the PRO 
polypeptide is inhibited. In a preferred aspect, this method comprises the steps of: 

(a) contacting cells and a test compound to be screened under conditions suitable for allowing 
35 expression of the PRO polypeptide; and 

(b) determining the inhibition of expression of said polypeptide. 

In yet another embodiment, the present invention concerns a method for treating immune related 
disease in a mammal that suffers therefrom comprising administering to the mammal a nucleic acid molecule 
that codes for either (a) a PRO polypeptide, (b) an agonist of a PRO polypeptide or (c) an antagonist of a 
40 PRO polypeptide, wherein said agonist or antagonist may be an anti-PRO antibody. In a preferred 
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embodiment, the mammal is human. In another preferred embodiment, the nucleic acid is administered via 
ex vivo gene therapy. In a further preferred embodiment, the nucleic acid is comprised within a vector, more 
preferably an adenoviral, adeno-associated viral, lentiviral or retroviral vector. 

In yet another aspect, the invention provides a recombinant viral particle comprising a viral vector 
5 consisting essentially of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide 
of a PRO polypeptide, or (c) an antagonist polypeptide of a PRO polypeptide, and a signal sequence for 
cellular secretion of the polypeptide, wherein the viral vector is in association with viral structural proteins. 
Preferably, the signal sequence is from a mammal, such as from a native PRO polypeptide. 

In a still further embodiment, the invention concerns an ex vivo producer cell comprising a nucleic 
10 acid construct that expresses retroviral structural proteins and also comprises a retroviral vector consisting 
essentially of a promoter, nucleic acid encoding (a) a PRO polypeptide, (b) an agonist polypeptide of a PRO 
polypeptide or (c) an antagonist polypeptide of a PRO polypeptide, and a signal sequence for cellular 
secretion of the polypeptide, wherein said producer cell packages the retroviral vector in association with the 
structural proteins to produce recombinant retroviral particles. 
15 B. Additional Embodiments 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. 
By way of example, the host cells may be CHO cells, E. coli, or yeast. A process for producing any of the 
herein described polypeptides is further provided and comprises culturing host cells under conditions 
20 suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell 
culture. 

In other embodiments, the invention provides chimeric molecules comprising any of the herein 
described polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such 
chimeric molecules comprise any of the herein described polypeptides fused to an epitope tag sequence or a 
25 Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which specifically binds to any of the 
above or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized 
antibody, antibody fragment or single-chain antibody. 

In yet other embodiments, the invention provides oligonucleotide probes useful for isolating 
30 genomic and cDNA nucleotide sequences or as antisense probes, wherein those probes may be derived from 
any of the above or below described nucleotide sequences. 

In other embodiments, the invention provides an isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes a PRO polypeptide. 

In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
35 about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, 
alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid 
sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 
85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
40 sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
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90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, 
alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid 
sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 
5 alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
encoding a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid 
sequence lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, 
with or without the signal peptide, as disclosed herein or any other specifically defined fragment of the full- 

10 length amino acid sequence as disclosed herein, or (b) die complement of the DNA molecule of (a). 

In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least 
about 80% nucleic acid sequence identity, alternatively at least about 81% nucleic acid sequence identity, 
alternatively at least about 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid 
sequence identity, alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 

15 85% nucleic acid sequence identity, alternatively at least about 86% nucleic acid sequence identity, 
alternatively at least about 87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid 
sequence identity, alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 
90% nucleic acid sequence identity, alternatively at least about 91% nucleic acid sequence identity, 
alternatively at least about 92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid 

20 sequence identity, alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 
95% nucleic acid sequence identity, alternatively at least about 96% nucleic acid sequence identity, 
alternatively at least about 97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid 
sequence identity and alternatively at least about 99% nucleic acid sequence identity to (a) a DNA molecule 
comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, the coding 

25 sequence of a PRO polypeptide lacking the signal peptide as disclosed herein, the coding sequence of an 
extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed 
herein or the coding sequence of any other specifically defined fragment of the full-length amino acid 
sequence as disclosed herein, or (b) the complement of the DNA molecule of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a 

30 nucleotide sequence having at least about 80% nucleic acid sequence identity, alternatively at least about 
81% nucleic acid sequence identity, alternatively at least about 82% nucleic acid sequence identity, 
alternatively at least about 83% nucleic acid sequence identity, alternatively at least about 84% nucleic acid 
sequence identity, alternatively at least about 85% nucleic acid sequence identity, alternatively at least about 
86% nucleic acid, sequence identity, alternatively at least about 87% nucleic acid sequence identity, 

35 alternatively at least about 88% nucleic acid sequence identity, alternatively at least about 89% nucleic acid 
sequence identity, alternatively at least about 90% nucleic acid sequence identity, alternatively at least about 
91% nucleic acid sequence identity, alternatively at least about 92% nucleic acid sequence identity, 
alternatively at least about 93% nucleic acid sequence identity, alternatively at least about 94% nucleic acid 
sequence identity, alternatively at least about 95% nucleic acid sequence identity, alternatively at least about 

40 96% nucleic acid sequence identity, alternatively at least about 97% nucleic acid sequence identity, 
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alternatively at least about 98% nucleic acid sequence identity and alternatively at least about 99% nucleic 
acid sequence identity to (a) a DNA molecule that encodes the same mature polypeptide encoded by any of 
the human protein cDNAs as disclosed herein, or (b) the complement of the DNA molecule of (a). 

Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide 

5 sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane 
domain-inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane 
domain(s) of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein 
described PRO polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence, or the 

10 complement thereof, that may find use as, for example, hybridization probes, for encoding fragments of a 
PRO polypeptide that may optionally encode a polypeptide comprising a binding site for an anti-PRO 
antibody or as antisense oligonucleotide probes. Such nucleic acid fragments are usually at least about 20 
nucleotides in length, alternatively at least about 30 nucleotides in length, alternatively at least about 40 
nucleotides in length, alternatively at least about 50 nucleotides in length, alternatively at least about 60 

15 nucleotides in length, alternatively at least about 70 nucleotides in length, alternatively at least about 80 
nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least about 100 
nucleotides in length, alternatively at least about 110 nucleotides in length, alternatively at least about 120 
nucleotides in length, alternatively at least about 130 nucleotides in length, alternatively at least about 140 
nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least about 160 

20 nucleotides in length, alternatively at least about 170 nucleotides in length, alternatively at least about 180 
nucleotides in length, alternatively at least about 190 nucleotides in length, alternatively at least about 200 
nucleotides in length, alternatively at least about 250 nucleotides in length, alternatively at least about 300 
nucleotides in length, alternatively at least about 350 nucleotides in length, alternatively at least about 400 
nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least about 500 

25 nucleotides in length, alternatively at least about 600 nucleotides in length, alternatively at least about 700 
nucleotides in length, alternatively at least about 800 nucleotides in length, alternatively at least about 900 
nucleotides in length and alternatively at least about 1000 nucleotides in length, wherein in this context the 
term "about" means the referenced nucleotide sequence length plus or minus 10% of that referenced length. 
It is noted that novel fragments of a PRO polypeptide-encoding nucleotide sequence may be determined in a 

30 routine manner by aligning the PRO polypeptide-encoding nucleotide sequence with other known nucleotide 
sequences using any of a number of well known sequence alignment programs and determining which PRO 
polypeptide-encoding nucleotide sequence fragment(s) are novel. All of such PRO polypeptide-encoding 
nucleotide sequences are contemplated herein. Also contemplated are the PRO polypeptide fragments 
encoded by these nucleotide molecule fragments, preferably those PRO polypeptide fragments that comprise 

35 a binding site for an anti-PRO antibody. 

In another embodiment, the invention provides isolated PRO polypeptide encoded by any of the 
isolated nucleic acid sequences herein above identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino 

40 acid sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least 
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about 83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, 
alternatively at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid 
sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively at least about 
88% amino acid sequence identity, alternatively at least about 89% amino acid sequence identity, 
5 alternatively at least about 90% amino acid sequence identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 
93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, 
alternatively at least about 95% amino acid sequence identity, alternatively at least about 96% amino acid 
sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at least about 

10 98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to a 
PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence 
lacking the signal peptide as disclosed herein, an extracellular domain of a transmembrane protein, with or 
without the signal peptide, as disclosed herein or any other specifically defined fragment of the full-length 
amino acid sequence as disclosed herein. 

15 In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 

sequence having at least about 80% amino acid sequence identity, alternatively at least about 81% amino 
acid sequence identity, alternatively at least about 82% amino acid sequence identity, alternatively at least 
about 83% amino acid sequence identity, alternatively at least about 84% amino acid sequence identity, 
alternatively at least about 85% amino acid sequence identity, alternatively at least about 86% amino acid 

20 sequence identity, alternatively at least about 87% amino acid sequence identity, alternatively at least about 
88% amino acid sequence identity, alternatively at least about 89% amino acid sequence identity, 
alternatively at least about 90% amino acid sequence identity, alternatively at least about 91% amino acid 
sequence identity, alternatively at least about 92% amino acid sequence identity, alternatively at least about 
93% amino acid sequence identity, alternatively at least about 94% amino acid sequence identity, 

25 alternatively at least about 95% amino acid sequence identity, alternatively at least about 96% amino acid 
sequence identity, alternatively at least about 97% amino acid sequence identity, alternatively at least about 
98% amino acid sequence identity and alternatively at least about 99% amino acid sequence identity to an 
amino acid sequence encoded by any of the human protein cDNAs as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal 

30 signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such 
an amino acid sequence as herein before described. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide 
and recovering the PRO polypeptide from the cell culture. 

35 Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane 

domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide 
and recovering the PRO polypeptide from the cell culture. 
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In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 
polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody 
or a small molecule. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a 
5 PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring 
a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO 
polypeptide. 

In a still further embodiment, the invention concerns a composition of matter comprising a PRO 
polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, 
10 in combination with a carrier. Optionally, the carrier is a pharmaceutical^ acceptable carrier. 

Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an 
agonist or antagonist thereof as herein before described, or an anti-PRO antibody, for the preparation of a 
medicament useful iri the treatment of a condition which is responsive to the PRO polypeptide, an agonist or 
antagonist thereof or an anti-PRO antibody. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a nucleotide sequence (SEQ ID NO:l) of a native sequence PR062782 cDNA, 
wherein SEQ ID NO: 1 is a clone designated herein as "DNA275062". 

Figure 2 shows the amino acid sequence (SEQ ID NO:2) derived from the coding sequence of SEQ 

20 ID NO: 1 shown in Figure 1 . 

Figure 3 shows a nucleotide sequence (SEQ ID NO:3) of a native sequence PR036822 cDNA, 
wherein SEQ ID NO:3 is a clone designated herein as "DNA226359". 

Figure 4 shows the amino acid sequence (SEQ ID NO:4) derived from the coding sequence of SEQ 
ID NO:3 shown in Figure 3. 

25 Figure 5 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR037226 cDNA, 

wherein SEQ ID NO:5 is a clone designated herein as "DNA226763". 

Figure 6 shows the amino acid sequence (SEQ ID NO:6) derived from the coding sequence of SEQ 
ID NO: 5 shown in Figure 5. 

Figure 7 shows a nucleotide sequence (SEQ ID NO:7) of a native sequence PR071142 cDNA, 
30 wherein SEQ ID NO:7 is a clone designated herein as "DNA304716". 

Figure 8 shows the amino acid sequence (SEQ ID NO:8) derived from the coding sequence of SEQ 
ID NO:7 shown in Figure 7. 

Figure 9 shows a nucleotide sequence (SEQ ID NO:9) of a native sequence PR038477 cDNA, 
wherein SEQ ID NO:9 is a clone designated herein as "DNA2280I4". 
35 Figure 10 shows the amino acid sequence (SEQ ID NO: 10) derived from the coding sequence of 

SEQ ID NO:9 shown in Figure 9. 

Figure 1 1 shows a nucleotide sequence (SEQ ID NO:l 1) of a native sequence PRO 10275 cDNA, 
wherein SEQ ID NO:l 1 is a clone designated herein as "DNA143498". 

Figure 12 shows the amino acid sequence (SEQ ID NO: 12) derived from the coding sequence of 
40 SEQ ID NO : 1 1 shown in Figure 1 1 . 
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Figure 13A-B shows a nucleotide sequence (SEQ ID NO: 13) of a native sequence PRO81407 
cDNA, wherein SEQ ID NO: 13 is a clone designated herein as "DNA324792". 

Figure 14 shows the amino acid sequence (SEQ ID NO: 14) derived from the coding sequence of 
SEQ ID NO: 13 shown in Figure 13A-B. 
5 Figure 15 shows a nucleotide sequence (SEQ ID NO: 15) of a native sequence PR069584 cDNA, 

wherein SEQ ID NO: 15 is a clone designated herein as "DNA287319". 

Figure 16 shows the amino acid sequence (SEQ ID NO: 16) derived from the coding sequence of 
SEQ ID NO: 15 shown in Figure 15. 

Figure 17 shows a nucleotide sequence (SEQ ID NO: 17) of a native sequence PR088263 cDNA, 
10 wherein SEQ ID NO: 17 is a clone designated herein as "DNA333620". 

Figure 18 shows the amino acid sequence (SEQ ID NO: 18) derived from the coding sequence of 
SEQ ID NO: J 7 shown in Figure 17. 
• Figure 19 shows a nucleotide sequence (SEQ ID NO: 19) of a native sequence PRO80958 cDNA, 

wherein SEQ ID NO:19 is a clone designated herein as "DNA324275". 
15 Figure 20 shows the amino acid sequence (SEQ ED NO: 20) derived from the coding sequence of 

SEQ ID NO: 19 shown in Figure 19. 

Figure 21 shows a nucleotide sequence (SEQ ID NO:21) of a native sequence PR038415 cDNA, 
wherein SEQ ID NO:21 is a clone designated herein as "DNA227952". 

Figure 22 shows the amino acid sequence (SEQ ID NO:22) derived from the coding sequence of 
20 SEQ ID NO:21 shown in Figure 21. 

Figure 23 shows a nucleotide sequence (SEQ ID NO:23) of a native sequence PR04834 cDNA, 
wherein SEQ ID NO:23 is a clone designated herein as "DNA103507". 

Figure 24 shows the amino acid sequence (SEQ ID NO:24) derived from the coding sequence of 
SEQ ID NO:23 shown in Figure 23. 
25 Figure 25 shows a nucleotide sequence (SEQ ID NO:25) of a native sequence PR036719 cDNA, 

wherein SEQ ID NO:25 is a clone designated herein as "DNA226256". 

Figure 26 shows the amino acid sequence (SEQ ID NO:26) derived from the coding sequence of 
SEQ ID NO:25 shown in Figure 25. 

Figure 27 shows a nucleotide sequence (SEQ ID NO:27) of a native sequence PROS 14 14 cDNA, 
30 wherein SEQ ID NO:27 is a clone designated herein as "DNA324799". 

Figure 28 shows the amino acid sequence (SEQ ED NO: 28) derived from the coding sequence of 
SEQ ID NO:27 shown in Figure 27. 

Figure 29 shows a nucleotide sequence (SEQ ID NO:29) of a native sequence PRO50891 cDNA, 
wherein SEQ ID NO:29 is a clone designated herein as "DNA255836". 
35 Figure 30 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of 

SEQ ID NO:29 shown in Figure 29. 

Figure 31A-B shows a nucleotide sequence (SEQ ID NO:31) of a native sequence PRO91480 
cDNA, wherein SEQ ID NO:3 1 is a clone designated herein as "DNA339972". 

Figure 32 shows the amino acid sequence (SEQ ID NO:32) derived from the coding sequence of 
40 SEQ ID NO:31 shown in Figure 3 1 A-B. 
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Figure 33 shows a nucleotide sequence (SEQ ID NO:33) of a native sequence PRO80648 cDNA, 
wherein SEQ ID NO:33 is a clone designated herein as "DNA323910". 

Figure 34 shows the amino acid sequence (SEQ ID NO:34) derived from the coding sequence of 
SEQ ID NO:33 shown in Figure 33. 
5 Figure 35 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR036735 cDNA, 

wherein SEQ ID NO:35 is a clone designated herein as "DNA226272". 

Figure 36 shows the amino acid sequence (SEQ ID NO:36) derived from the coding sequence of 
SEQ ID NO: 35 shown in Figure 35. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:37) of a native sequence PRO 12050 cDNA, 
10 wherein SEQ ID NO:37 is a clone designated herein as "DNA151772". 

Figure 38 shows the amino acid sequence (SEQ ID NO:38) derived from the coding sequence of 
SEQ ID NO:37 shown in Figure 37. 

Figure 39A-B shows a nucleotide sequence (SEQ ID NO:39) of a native sequence PRO83903 
cDNA, wherein SEQ ID NO:39 is a clone designated herein as "DNA327983". 
15 Figure 40 shows the amino acid sequence (SEQ ID NO:40) derived from the coding sequence of 

SEQ ID NO:39 shown in Figure 39A-B. 

Figure 41A-C shows a nucleotide sequence (SEQ ID NO:41) of a native sequence PRO37509 
cDNA, wherein SEQ ID NO:41 is a clone designated herein as "DNA227046". 

Figure 42A-B shows the amino acid sequence (SEQ ID NO:42) derived from the coding sequence 
20 of SEQ ID NO:41 shown in Figure 4A-C 

Figure 43 shows a nucleotide sequence (SEQ ID NO:43) of a native sequence PRO37760 cDNA, 
wherein SEQ ID NO:43 is a clone designated herein as "DNA227297". 

Figure 44 shows the amino acid sequence (SEQ ID NO:44) derived from the coding sequence of 
SEQ ID NO:43 shown in Figure 43. 
25 Figure 45 shows a nucleotide sequence (SEQ ID NO:45) of a native sequence PR092173 cDNA, 

wherein SEQ ID N0:45 is a clone designated herein as "DNA340442". 

Figure 46 shows the amino acid sequence (SEQ ID NO:46) derived from the coding sequence of 
SEQ ID NO:45 shown in Figure 45. 

Figure 47 shows a nucleotide sequence (SEQ ID NO:47) of a native sequence PRO10849 cDNA, 
30 wherein SEQ ID NO:47 is a clone designated herein as "DNA324641". 

Figure 48 shows the amino acid sequence (SEQ ID NO:48) derived from the coding sequence of 
SEQ ID NO:47 shown in Figure 47. 

Figure 49 shows a nucleotide sequence (SEQ ID NO:49) of a native sequence PR083649 cDNA, 
wherein SEQ ID NO:49 is a clone designated herein as "DNA327651". 
35 Figure 50 shows the amino acid sequence (SEQ ID NO:50) derived from the coding sequence of 

SEQ ID NO:49 shown in Figure 49. 

Figure 51 shows a nucleotide sequence (SEQ ID NO:51) of a native sequence PR086795 cDNA, 
wherein SEQ ID N0:5 1 is a clone designated herein as "DNA331909". 

Figure 52 shows the amino acid sequence (SEQ ID N0:52) derived from the coding sequence of 
40 SEQ ID NO:5 1 shown in Figure 51 . 

13 
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Figure 53 shows a nucleotide sequence (SEQ ID NO:53) of a native sequence PR036574 cDNA, 
wherein SEQ ID NO:53 is a clone designated herein as "DNA2261 11". 

Figure 54 shows the amino acid sequence (SEQ ID NO:54) derived from the coding sequence of 
SEQ ID NO:53 shown in Figure 53. 
5 Figure 55 shows a nucleotide sequence (SEQ ID NO:55) of a native sequence PR069541 cDNA, 

wherein SEQ ID NO:55 is a clone designated herein as "DNA287270". 

Figure 56 shows the amino acid sequence (SEQ ID NO:56) derived from the coding sequence of 
SEQ ID NO:55 shown in Figure 55. 

Figure 57 shows a nucleotide sequence (SEQ ID NO:57) of a native sequence PR084948 cDNA, 
10 wherein SEQ ID NO:57 is a clone designated herein as "DNA329369". 

Figure 58 shows the amino acid sequence (SEQ ID NO:58) derived from the coding sequence of 
SEQ ID NO:57 shown in Figure 57. 

Figure 59A-B si^iows a nucleotide sequence (SEQ ID NO:59) of a native sequence PR085817 
cDNA, wherein SEQ ID NO:59 is a clone designated herein as "DNA330645". 
15 Figure 60 shows the amino acid sequence (SEQ ID NO:60) derived from the coding sequence of 

SEQ ID NO:59 shown in Figure 59A-B. 

Figure 61A-B shows a nucleotide sequence (SEQ ID NO:61) of a native sequence PR038371 
cDNA, wherein SEQ ID NO:61 is a clone designated herein as "DNA227908". 

Figure 62A-B shows the amino acid sequence (SEQ ID NO:62) derived from the coding sequence 
20 of SEQ ID NO:61 shown in Figure 61 A-B. 

Figure 63 shows a nucleotide sequence (SEQ ID NO:63) of a native sequence PRO90951 cDNA, 
wherein SEQ ID NO:63 is a clone designated herein as "DNA340375". 

Figure 64 shows the amino acid sequence (SEQ ID NO:64) derived from the coding sequence of 
SEQ ID NO:63 shown in Figure 63. 

25 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

The terms "PRO polypeptide" and "PRO" as used herein and when immediately followed by a 
numerical designation refer to various polypeptides, wherein the complete designation (i.e., PRO/number) 

30 refers to specific polypeptide sequences as described herein. The terms "PRO/number polypeptide" and 
"PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein 
encompass native sequence polypeptides and polypeptide variants (which are further defined herein). The 
PRO polypeptides described herein may be isolated from a variety of sources, such as from human tissue 
types or from another source, or prepared by recombinant or synthetic methods. The term "PRO 

35 polypeptide" refers to each individual PRO/number polypeptide disclosed herein. AH disclosures in this 
specification which refer to the "PRO polypeptide" refer to each of the polypeptides individually as well as 
jointly. For example, descriptions of the preparation of, purification of, derivation of, formation of 
antibodies to or against, administration of, compositions containing, treatment of a disease with, etc., pertain 
to each polypeptide of the invention individually. The term "PRO polypeptide" also includes variants of the 

40 PRO/number polypeptides disclosed herein. 
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A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid 
sequence as the corresponding PRO polypeptide derived from nature. Such native sequence PRO 
polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. The term 
"native sequence PRO polypeptide" specifically encompasses naturally-occurring truncated or secreted 
forms of the specific PRO polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant 
forms (e.g., alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In 
various embodiments of the invention, the native sequence PRO polypeptides disclosed herein are mature or 
full-length native sequence polypeptides comprising the full-length amino acids sequences shown in the 
accompanying figures. Start and stop codons are shown in bold font and underlined in the figures. 
However, while the PRO polypeptide disclosed in the accompanying figures are shown to begin with 
methionine residues designated herein as amino acid position 1 in the figures, it is conceivable and possible 
that other methionine residues located either upstream or downstream from the amino acid position 1 in the 
figures may be employed as the starting amino acid residue for the PRO polypeptides. \ 

The PRO polypeptide "extracellular domain" or "ECD" refers to a form of the PRO polypeptide 
which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide 
ECD will have less than 1% of such transmembrane and/or cytoplasmic domains and preferably, will have 
less than 0.5% of such domains. It will be understood that any transmembrane domains identified for the 
PRO polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for 
identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary 
but most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. 
Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer 
amino acids on either side of the transmembrane domain/extracellular domain boundary as identified in the 
Examples or specification and such polypeptides, with or without the associated signal peptide, and nucleic 
acid encoding them, are contemplated by the present invention. 

The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein 
are shown in the present specification and/or the accompanying figures. It is noted, however, that the C- 
terminal boundary of a signal peptide may vary, but most likely by no more than about 5 amino acids on 
either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal 
boundary of the signal peptide may be identified pursuant to criteria routinely employed in the art for 
identifying that type of amino acid sequence element (e.g., Nielsen et al., Prot. Eng. 10:1-6 (1997) and von 
Heinje et al, Nucl. Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, 
cleavage of a signal sequence from a secreted polypeptide is not entirely uniform, resulting in more than one 
secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 
5 amino acids on either side of the C-terminal boundary of the signal peptide as identified herein, and the 
polynucleotides encoding them, are contemplated by the present invention. 

"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at 
least about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence 
as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an 
extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any 
other fragment of a full-length PRO polypeptide sequence as disclosed herein. Such PRO polypeptide 
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variants include, for instance, PRO polypeptides wherein one or more amino acid residues are added, or 
deleted, at the N- or C-terminus of the full-length native amino acid sequence. Ordinarily, a PRO 
polypeptide variant will have at least about 80% amino acid sequence identity, alternatively at least about 
81% amino acid sequence identity, alternatively at least about 82% amino acid sequence identity, 
5 alternatively at least about 83% amino acid sequence identity, alternatively at least about 84% amino acid 
sequence identity, alternatively at least about 85% amino acid sequence identity, alternatively at least about 
86% amino acid .sequence identity, alternatively at least about 87% amino acid sequence identity, 
alternatively at least about 88% amino acid sequence identity, alternatively at least about 89% amino acid 
sequence identity, alternatively at least about 90% amino acid sequence identity, alternatively at least about 
10 91% amino acid sequence identity, alternatively at least about 92% amino acid sequence identity, 
alternatively at least about 93% amino acid sequence identity, alternatively at least about 94% amino acid 
sequence identity, alternatively at least about 95% amino acid sequence identity, alternatively at least about 
96% amino acid sequence identity, alternatively at least a\bout 97% amino acid sequence identity, 
alternatively at least about 98% amino acid sequence identity and alternatively at least about 99% amino 
15 acid sequence identity to a full-length native sequence PRO polypeptide sequence as disclosed herein, a 
PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO 
polypeptide, with or without the signal peptide, as disclosed herein or any other specifically defined 
fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, PRO variant 
polypeptides are at least about 10 amino acids in length, alternatively at least about 20 amino acids in length, 
20 alternatively at least about 30 amino acids in length, alternatively at least about 40 amino acids in length, 
alternatively at least about 50 amino acids in length, alternatively at least about 60 amino acids in length, 
alternatively at least about 70 amino acids in length, alternatively at least about 80 amino acids in length, 
alternatively at least about 90 amino acids in length, alternatively at least about 100 amino acids in length, 
alternatively at least about 150 amino acids in length, alternatively at least about 200 amino acids in length, 
25 alternatively at least about 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences 
identified herein is defined as the percentage of amino acid residues in a candidate sequence that are 
identical with the- amino acid residues in the specific PRO polypeptide sequence, after aligning the 
sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not 
30 considering any conservative substitutions as part of the sequence identity. Alignment for purposes of 
determining percent amino acid sequence identity can be achieved in various ways that are within the skill in 
the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or 
Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for 
measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of 
35 the sequences being compared. For purposes herein, however, % amino acid sequence identity values are 
generated using the sequence comparison computer program ALIGN-2, wherein the complete source code 
for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer 
program was authored by Genentech, Inc. and the source code shown in Table I below has been filed with 
user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 
40 Copyright Registration No. TXU510087. The ALIGN-2 program is publicly available through Genentech, 
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Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid 
sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B 
(which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % 
amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows: 

100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment 
program ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid 
residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the 
length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. As examples of % amino acid sequence identity calculations using this 
method, Tables 2 and 3 demonstrate how to calculate the % amino acid sequence identity of the amino acid 
sequence designated "Comparison Protein" to the amino acid sequence designated "PRO", wherein "PRO" 
represents the amino acid sequence of a hypothetical PRO polypeptide of interest, "Comparison Protein" 
represents the amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is 
being compared, and "X, T and "Z" each represent different hypothetical amino acid residues. 

Unless specifically stated otherwise, all % amino acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However, % amino acid sequence identity values may also be obtained as described below by using the WU- 
BLAST-2 computer program (Altschul et al., Methods in Enzvmolo^v 266:460-480 (1996)). Most of the 
WU-BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the 
adjustable parameters, are set with the following values: overlap span = 1, overlap fraction = 0.125, word 
threshold (T) = 11, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % amino acid 
sequence identity value is determined by dividing (a) the number of matching identical amino acid residues 
between the amino acid sequence of the PRO polypeptide of interest having a sequence derived from the 
native PRO polypeptide and the comparison amino acid sequence of interest (i.e., the sequence against 
which the PRO polypeptide of interest is being compared which may be a PRO variant polypeptide) as 
determined by WU-BLAST-2 by (b) the total number of amino acid residues of the PRO polypeptide of 
interest. For example, in the statement "a polypeptide comprising an the amino acid sequence A which has 
or having at least 80% amino acid sequence identity to the amino acid sequence B", the amino acid sequence 
A is the comparison amino acid sequence of interest and the amino acid sequence B is the amino acid 
sequence of the PRO polypeptide of interest. 

Percent amino acid sequence identity may also be determined using the sequence comparison 
program NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 
sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained 
from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, 
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wherein all of those search parameters are set to default values including, for example, unmask = yes, strand 
= all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01, 
constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % 
5 amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid 
sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as 
follows: 

10 100 times the fraction X/Y 

where X is the number of amino acid residues scored as identical matches by the sequence alignment 
program NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino \ 
acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the 

15 lengdi of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 

"PRO variant polynucleotide" or 'TRO variant nucleic acid sequence" means a nucleic acid 
molecule which encodes an active PRO polypeptide as defined below and which has at least about S0% 
nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native sequence PR.O 

20 polypeptide sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking 
the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the 
signal peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as 
disclosed herein. Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid 
sequence identity, alternatively at least about 81% nucleic acid sequence identity, alternatively at least about 

25 82% nucleic acid sequence identity, alternatively at least about 83% nucleic acid sequence identity, 
alternatively at least about 84% nucleic acid sequence identity, alternatively at least about 85% nucleic acid 
sequence identity, alternatively at least about 86% nucleic acid sequence identity, alternatively at least about 
87% nucleic acid sequence identity, alternatively at least about 88% nucleic acid sequence identity, 
alternatively at least about 89% nucleic acid sequence identity, alternatively at least about 90% nucleic acid 

30 sequence identity, alternatively at least about 91% nucleic acid sequence identity, alternatively at least about 
92% nucleic acid sequence identity, alternatively at least about 93% nucleic acid sequence identity, 
alternatively at least about 94% nucleic acid sequence identity, alternatively at least about 95% nucleic acid 
sequence identity, alternatively at least about 96% nucleic acid sequence identity, alternatively at least about 
97% nucleic acid sequence identity, alternatively at least about 98% nucleic acid sequence identity and 

35 alternatively at least about 99% nucleic acid sequence identity with a nucleic acid sequence encoding a full- 
length native sequence PRO polypeptide sequence as disclosed herein, a full-length native sequence PRO 
polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO 
polypeptide, with or without the signal sequence, as disclosed herein or any other fragment of a full-length 
PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide sequence. 
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Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, alternatively at 
least about 60 nucleotides in length, alternatively at least about 90 nucleotides in length, alternatively at least 
about 120 nucleotides in length, alternatively at least about 150 nucleotides in length, alternatively at least 
about 180 nucleotides in length, alternatively at least about 210 nucleotides in length, alternatively at least 
5 about 240 nucleotides in length, alternatively at least about 270 nucleotides in length, alternatively at least 
about 300 nucleotides in length, alternatively at least about 450 nucleotides in length, alternatively at least 
about 600 nucleotides in length, alternatively at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with 

10 the nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing 
gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of 
determining percent nucleic acid sequence identity can be achieved in various ways that are within the skill 
in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or 
Megalign (DNASTAR) software. For purposes herein, however, % nucleic acid sequence identity values are 

15 generated using the sequence comparison computer program ALIGN-2, wherein the complete source code 
for the ALIGN-2 program is provided in Table 1 below. The ALIGN-2 sequence comparison computer 
program was authored by Genentech, Inc. and the source code shown in Table 1 below has been filed with 
user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. 
Copyright Registration No. TXU5 10087. The ALIGN-2 program is publicly available through Genentech, 

20 Inc., South San Francisco, California or may be compiled from the source code provided in Table 1 below. 
The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX 
V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary. 

In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic 
acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence 

25 D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

100 times the fraction W/Z 

30 where W is the number of nucleotides scored as identical matches by the sequence alignment program 
ALIGN-2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It 
will be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid 
sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence 
identity of D to C. As examples of % nucleic acid sequence identity calculations, Tables 4 and 5, 

35 demonstrate how to calculate the % nucleic acid sequence identity of the nucleic acid sequence designated 
"Comparison DNA" to the nucleic acid sequence designated "PRO-DNA", wherein "PRO-DNA" represents 
a hypothetical PRO-encoding nucleic acid sequence of interest, "Comparison DNA" represents the 
nucleotide sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of 
interest is being compared, and "N", "L" and "V" each represent different hypothetical nucleotides. 
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Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are 
obtained as described in the immediately preceding paragraph using the ALIGN-2 computer program. 
However, % nucleic acid sequence identity values may also be obtained as described below by using the 
■ WU-BLAST-2 computer program (Altschul et al., Methods in Enzvmology 266:460-480 (1996)). Most of 

5 the WU-BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the 
adjustable parameters, are set with the following values: overlap span = I, overlap fraction = 0.125, word 
threshold (T) = 11, and scoring matrix = BLOSUM62. When WU-BLAST-2 is employed, a % nucleic acid 
sequence identity value is determined by dividing (a) the number of matching identical nucleotides between 
the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest having a 

10 sequence derived from the native sequence PRO polypeptide-encoding nucleic acid and the comparison 
nucleic acid molecule of interest (i.e., the sequence against which the PRO polypeptide-encoding nucleic 
acid molecule of interest is being compared which may be a variant PRO polynucleotide) as determined by 
WU-BLAST-2 by (b) the total number of nucleotides of the PRO polypeptide-encoding nucleic acid 
molecule of interest. For example, in the statement "an isolated nucleic acid molecule comprising a nucleic 

15 acid sequence A which has or having at least 80% nucleic acid sequence identity to the nucleic acid 
sequence B", the nucleic acid sequence A is the comparison nucleic acid molecule of interest and the nucleic 
acid sequence B is the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of 
interest. 

Percent nucleic acid sequence identity may also be determined using the sequence comparison 
20 program NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 
sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov or otherwise obtained 
from the National Institute of Health, Bethesda, MD. NCBI-BLAST2 uses several search parameters, 
wherein all of those search parameters are set to default values including, for example, unmask = yes, strand 
= all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value = 0.01, 
25 constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62. 

In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid 
sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D 
(which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % 
nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows: 

30 

100 times the fraction W/Z 

where W is the number of nucleotides scored as identical matches by the sequence alignment program 
NCBI-BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in 
35 D. It will be appreciated that where the length of nucleic acid sequence C is not equal to the length of 
nucleic acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid 

sequence identity of D to C. 

In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an 
active PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and 
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wash conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO 
variant polypeptides may be those that are encoded by a PRO variant polynucleotide. 

"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that 
has been identified and separated and/or recovered from a component of its natural environment. 
5 Contaminant components of its natural environment are materials that would typically interfere with 
diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other 
proteinaceous or non-proteinaceous solutes. In preferred embodiments, the polypeptide will be purified (1) 
to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a 
spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions 

10 using Coomassie blue or, preferably, silver stain. Isolated polypeptide includes polypeptide in situ within 
recombinant cells, since at least one component of the PRO polypeptide natural environment will not be 
present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid or other polypeptide-encoding nucleic acid is 
a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule 

15 with which it is ordinarily associated in the natural source of the polypeptide-encoding nucleic acid. An 
isolated polypeptide-encoding nucleic acid molecule is other than in the form or setting in which it is found 
in nature. Isolated polypeptide-encoding nucleic acid molecules therefore are distinguished from the 
specific polypeptide-encoding nucleic acid molecule as it exists in natural cells. However, an isolated 
polypeptide-encoding nucleic acid molecule includes polypeptide-encoding nucleic acid molecules 

20 contained in cells that ordinarily express the polypeptide where, for example, the nucleic acid molecule is in 
a chromosomal location different from that of natural cells. 

The term "conuol sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. The control sequences that are suitable for 
prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. 

25 Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 
for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a 
promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; 

30 or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous, and, 
in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be 
contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the 
synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. 

35 The term "antibody" is used in the broadest sense and specifically covers, for example, single anti- 

PRO monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody 
compositions with polyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from 
a population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the 
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population are identical except for possible naturally-occurring mutations that may be present in minor 
amounts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, 
and generally is an empirical calculation dependent upon probe length, washing temperature, and salt 

5 concentration. In general, longer probes require higher temperatures for proper annealing, while shorter 
probes need lower temperatures. Hybridization generally depends on the ability of denatured DNA to 
reanneal when complementary strands are present in an environment below their melting temperature. The 
higher the degree of desired homology between the probe and hybridizable sequence, the higher the relative 
temperature which can be used. As a result, it follows that higher relative temperatures would tend to make 

10 the reaction conditions more stringent, while lower temperatures less so. For additional details and 
explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in Mo lecular 
Biology , Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by 
those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 

15 chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 
denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum 
albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 
0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's 

20 solution, sonicated salmon sperm DNA (50 u.g/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with 
washes at 42°C in 0.2 x SSC (sodium chloride/sodium citrate) and 50% formamide at 55°C, followed by a 
high-stringency wash consisting of 0.1 x SSC containing EDTA at 55°C 

"Moderately stringent conditions" may be identified as described by Sambrook et al, Molecular 
Cloning: A Laboratory Manual , New York: Cold Spring Harbor Press, 1989, and include the use of washing 

25 solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately stringent conditions is overnight incubation at 37°C in a 
solution comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon 
sperm DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize 

30 how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length 
and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope 
against which an antibody can be made, yet is short enough such that it does not interfere with activity of the 
35 polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody 
does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six 
amino acid residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 
and 20 amino acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
40 binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
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constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
5 ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-l and IgA-2), IgE, 
IgD or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a 
biological and/or an immunological activity of native or naturally-occurring PRO, wherein "biological" 
10 activity refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally- 
occurring PRO other than the ability to induce the production of an antibody against an antigenic epitope 
possessed by a native or naturally-occurring PRO and an "immunological" activity refers to the ability to 
induce the production of an antibody against an antigenic epitope possessed by a native or naturally- 
occurring PRO. 

15 The term "antagonist" is used in the broadest sense, and includes any molecule that partially or 

fully blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a 
similar manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a 
biological activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules 
specifically include agonist or antagonist antibodies or antibody fragments, fragments or amino acid 

20 sequence variants of native PRO polypeptides, peptides, antisense oligonucleotides, small organic 
molecules, etc. Methods for identifying agonists or antagonists of a PRO polypeptide may comprise 
contacting a PRO polypeptide with a candidate agonist or antagonist molecule and measuring a detectable 
change in one or more biological activities normally associated with the PRO polypeptide. 

"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, 

25 wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those 
in need of treatment include those already with the disorder as well as those prone to have the disorder or 
those in whom the disorder is to be prevented. 

"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed 
to an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 

30 "Intermittent" administration is treatment that is not consecutively done without interruption, but rather is 
cyclic in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including 
humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, 
pigs, goats, rabbits, etc. Preferably, the mammal is human. 
35 Administration "in combination with" one or more further therapeutic agents includes simultaneous 

(concurrent) and consecutive administration in any order. 

"Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers 
which are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations 
employed. Often the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of 
40 physiologically acceptable carriers include buffers such as phosphate, citrate, and other organic acids; 

23 



WO 2004/081199 



PCT7US2004/007862 



antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; 
proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating 

5 agents such as EDTA; sugar alcohols such as mannitoi or sorbitol; salt-forming counterions such as sodium; 
and/or nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), and PLURONICS™. 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab', F(ab') 2 , and Fv 
fragments; diabodies; linear antibodies (Zapata et al, Protein Eng. 8(10): 1057-1062 [1995]); single-chain 

10 antibody molecules; and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the 
ability to crystallize readily. Pepsin treatment yields a^ F(ab'h fragment that has two antigen-combining 
sites and is still capable of cross-linking antigen. 

15 "Fv" is the minimum antibody fragment which contains a complete antigen-recognition and - 

binding site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non- 
covalent association. It is in this configuration that the three CDRs of each variable domain interact to 
define an antigen-binding site on the surface of the V H -V L dimer. Collectively, the six CDRs confer antigen- 
binding specificity to the antibody. However, even a single variable domain (or half of an Fv comprising 

20 only three CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower 
affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab' fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 

25 region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains 
bear a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab' fragments 
which have hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to 
one of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their 

30 constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: 
IgA, IgD, IgE, IgG, and IgM, and several of these may be further divided into subclasses (isotypes), e.g., 
IgGl, IgG2, IgG3, IgG4, IgA, and IgA2. 
35 "Single-chain Fv" or "sFv" antibody fragments comprise the V H and V L domains of antibody, 

wherein these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further 
comprises a polypeptide linker between the V H and V L domains which enables the sFv to form the desired 
structure for antigen binding. For a review of sFv, see Pluckthun in The Pharma cology of Monoclonal 
Antibodies , vol. 113, Rosenburg and Moore eds., Springer-Verlag, New York, pp. 269-315 (1994). 
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The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) in 
the same polypeptide chain (V h -Vl). By using a linker that is too short to allow pairing between the two 
domains on the same chain, the domains are forced to pair with the complementary domains of another chain 
5 and create two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 
93/1 1 161 ; and Hollinger et al„ Proc. Natl. Acad. Sci. USA , 90:6444-6448 (1993). 

An "isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural environment are materials 
which would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, 
10 hormones, and other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody 
will be purified (1) to greater than 95% by weight of antibody as determined by the Lowry method, and most 
preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or 

i 

internal amino acid sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE 1 
under reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated 

15 antibody includes the antibody in situ within recombinant cells since at least one component of the 
antibody's natural environment will not be present. Ordinarily, however, isolated antibody will be prepared 
by at least one purification step. 

An antibody that "specifically binds to" or is "specific for" a particular polypeptide or an epitope on 
a particular polypeptide is one that binds to that particular polypeptide or epitope on a particular polypeptide 

20 without substantially binding to any other polypeptide or polypeptide epitope. 

The word "label" when used herein refers to a detectable compound or composition which is 
conjugated directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be 
detectable by itself (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may 
catalyze chemical alteration of a substrate compound or composition which is detectable. 

25 By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can 

adhere. Examples of solid phases encompassed herein include diose formed partially or entirely of glass 
(e.g., controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol 
and silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an 
assay plate; in others it is a purification column (e.g., an affinity chromatography column). This term also 

30 includes a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 
4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or 
surfactant which is useful for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a 
mammal. The components of the liposome are commonly arranged in a bilayer formation, similar to the 
35 lipid arrangement of biological membranes. 

A "small molecule" is defined herein to have a molecular weight below about 500 Daltons. 

The term "immune related disease" means a disease in which a component of the immune system of 
a mammal causes, mediates or otherwise contributes to a morbidity in the mammal. Also included are 
diseases in which stimulation or intervention of the immune response has an ameliorative effect on 
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progression of the disease. Included within this term are immune-mediated inflammatory diseases, non- 
immune-mediated inflammatory diseases, infectious diseases, immunodeficiency diseases, neoplasia, etc. 

The term 'T cell mediated disease" means a disease in which T cells directly or indirectly mediate 
or otherwise contribute to a morbidity in a mammal. The T cell mediated disease may be associated with 
5 cell mediated effects, lymphokine mediated effects, etc. y and even effects associated with B cells if the B 
cells are stimulated, for example, by the lymphokines secreted by T cells. 

As used herein the term "psoriasis" is defined as a condition characterized by the eruption of 
circumscribed, discreet and confluent, reddish, silvery-scaled macropapules preeminently on the elbows, 
knees, scalp and trunk. 

10 As used herein the term "Inflammatory bowel disease" is defined as inflammatory disorders in 

which the intestine (bowel) becomes inflamed, often causing recurring cramps or diarrhea. 

The term "effective amount" is a concentration or amount of a PRO polypeptide and/or 
agonist/antagonist which results in achieving a particular stated purpose. An "effective amount" of a PRO \ 
polypeptide or agonist or antagonist thereof may be determined empirically. Furthermore, a "therapeutically 

15 effective amount" is a concentration or amount of a PRO polypeptide and/or agonist/antagonist which is 
effective for achieving a stated therapeutic effect. This amount may also be determined empirically. 

The term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function 
of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes {e.g., I , 
I 125 , Y 90 and Re 186 ), chemotherapeutic agents, and toxins such as enzymatically active toxins of bacterial, 

20 fungal, plant or animal origin, or fragments thereof. 

A "chemotherapeutic agent" is a chemical compound useful in the treatment of cancer. Examples 
of chemotherapeutic agents include adriamycin, doxorubicin, epirubicin, 5-fluorouracil, cytosine arabinoside 
("Ara-C"), cyclophosphamide, thiotepa, busulfan, cytoxin, taxoids, e.g., paclitaxel (Taxol, Bristol-Myers 
Squibb Oncology, Princeton, NJ), and doxetaxel (Taxotere, Rhone-Poulenc Rorer, Antony, France), 

25 toxotere, methotrexate, cisplatin, melphalan, vinblastine, bleomycin, etoposide, ifosfamide, mitomycin C, 
mitoxantrone, vincristine, vinorelbine, carboplatin, teniposide, daunomycin, carminomycin, aminopterin, 
dactinomycin, mitomycins, esperamicins (see U.S. Pat. No. 4,675,187), melphalan and other related nitrogen 
mustards. Also,included in this definition are hormonal agents that act to regulate or inhibit hormone action 
on tumors such as tamoxifen and onapristone. 

30 A "growth inhibitory agent" when used herein refers to a compound or composition which inhibits 

growth of a cell, especially cancer cell overexpressing any of the genes identified herein, either in vitro or in 
vivo. Thus, the growth inhibitory agent is one which significantly reduces the percentage of cells 
overexpressing such genes in S phase. Examples of growth inhibitory agents include agents that block cell 
cycle progression (at a place other than S phase), such as agents that induce Gl arrest and M-phase arrest. 

35 Classical M-phase blockers include the vincas (vincristine and vinblastine), taxol, and topo II inhibitors such 
as doxorubicin, epirubicin, daunorubicin, etoposide, and bleomycin. Those agents that arrest Gl also spill 
over into S-phase arrest, for example, DNA alkylating agents such as tamoxifen, prednisone, dacarbazine, 
mechlorethamine, cisplatin, methotrexate, 5-fluorouracil, and ara-C. Further information can be found in 
The Molecular Basis of Cancer, Mendelsohn and Israel, eds., Chapter 1, entitled "Cell cycle regulation, 

40 oncogens, and antineoplastic drugs" by Murakami et al (WB Saunders: Philadelphia, 1995), especially p. 
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13. 

The term "cytokine" is a generic term for proteins released by one cell population which act on 
another cell as intercellular mediators. Examples of such cytokines are lymphokines, monokines, and 
traditional polypeptide hormones. Included among the cytokines are growth hormone such as human growth 

5 hormone, N-methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; 
thyroxine; insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating 
hormone (FSH), thyroid stimulating hormone (TSH), and luteinizing hormone (LH); hepatic growth factor; 
fibroblast growth factor; prolactin; placental lactogen; tumor necrosis factor-oc and -(3; mullerian-inhibiting 
substance; mouse gonadotropin-associated peptide; inhibin; activin; vascular endothelial growth factor; 

10 integrin; thrombopoietin (TPO); nerve growth factors such as NGF-|5; platelet-growth factor; transforming 
growth factors (TGFs) such as TGF-cc and TGF-P; insulin-like growth factor-I and -II; erythropoietin (EPO); 
osteoinductive factors; interferons such as interferon-a, -p, and -y; colony stimulating factors (CSFs) such as 

^ macrophage-CSF (M-CSF); granulocyte-macrophage-CSF (GM-CSF); and granulocyte-CSF (G-CSF); 
interleukins (ILs) such as IL-1, IL-la, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-U, IL-12; a tumor 

15 necrosis factor such as TNF-a or TNF-P; and other polypeptide factors including LTF and kit ligand (KL). 
As used herein, the term cytokine includes proteins from natural sources or from recombinant cell culture 
and biologically active equivalents of the native sequence cytokines. 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 

20 constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any 

25 immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, 
IgD or IgM. 

As used herein, the term "inflammatory cells" designates cells that enhance the inflammatory 
response such as mononuclear cells, eosinophils, macrophages, and polymorphonuclear neutrophils (PMN). 
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Table 1 



10 



* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is Jvt; stop-stop = 0; J Q'okcr) match = 0 
*/ 

#define _M -8 /* value of a match with a stop */ 



int _day[26][26] = { 

/* ABCDEFGHIJKLMNOPQRSTUVWXYZ*/ 

15 /* A */ { 2, 0,-2, 0, 0,-4, 1,-1,-1, 0,-1,-2,-1, 0,_M, 1, 0,-2, 1,1,0, 0,-6, 0,-3, 0}, 

/* B */ { 0, 3,-4, 3, 2,-5, 0, 1,-2, 0, 0,-3,-2, 2,_M,-1, 1, 0, 0, 0, 0,-2,-5, 0,-3, 1}, 

/* C */ {-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5,-4,Jvl,-3,-5,-4, 0,-2, 0,-2,-8, 0, 0,-5}, 

/* D */ { 0, 3,-5, 4, 3,-6, 1, 1,-2, 0, 0,-4,-3, 2,_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 

/* E */ { 0, 2,-5, 3, 4,-5, 0, 1,-2, 0, 0,-3,-2, 1«_M,-1, 2,-1, 0, 0, 0,-2,-7, 0,-4, 3}, 

20 /* F */ {-4,-5,-4,-6,-5. 9,-5,-2, 1, 0,-5, 2, 0,-4_M,-5,-5,-4,-3,-3, 0,-1 , 0, 0, 7,-5}, 

/* G */ { 1, 0,-3, 1, 0,-5, 5,-2,-3, 0,-2,-4,-3, 0,_M,-l,-l,-3, 1, 0, 0,-1,-7, 0,-5, 0}, 

/* H */ {-1, 1,-3, 1, 1,-2,-2, 6,-2, 0, 0,-2,-2, 2,_M, 0, 3, 2,-1,-1, 0,-2,-3, 0, 0, 2}, 

/* I */ {-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2, 2,-2,_M,-2,-2,-2,-l, 0, 0, 4,-5, 0,-1,-2}, 

/* J */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

25 /* K */ {-1, 0,-5, 0, 0,-5,-2, 0,-2, 0, 5,-3, 0, 1,_M,-1, 1, 3, 0, 0, 0,-2,-3, 0,-4, 0}, 

/* L */ {-2,-3,-6,-4,-3, 2,-4,-2, 2, 0,-3, 6, 4,-3_M,-3,-2,-3,-3,-l, 0, 2,-2, 0,-1,-2}, 

/* M */ {-1,-2,-5,-3,-2, 0,-3,-2, 2, 0, 0, 4, 6,-2,_M,-2,-l, 0,-2,-1 , 0, 2,-4, 0,-2,-1}, 

/* N */ { 0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1,-3,-2, 2,_M,-1, 1, 0, 1, 0, 0,-2,-4, 0,-2, 1}, 

/* O */ {_M,„M,_M,_M,34,_M,_M,_M,_M,_M,_M,_M,_M,„M, 0,_M,_M,^M^M,„M^M,_M,_M,^M,_M,_M}, 

30 /* P */ { 1,-1,-3,-1,-1,-5,-1, 0,-2, 0,-l,-3,-2,-l,_M, 6, 0, 0, 1, 0, 0,-1,-6, 0,-5, 0}, 

/*Q*/ { 0, 1,-5, 2, 2,-5,-1, 3,-2, 0, 1,-2,-1, 1,_M, 0, 4, 1,-1,-1, 0,-2,-5, 0,-4, 3}, 

/* R */ {-2, 0,-4,-1 ,-1 ,-4,-3, 2,-2, 0, 3,-3, 0, 0_M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 

/* S */ { 1, 0, 0, 0, 0,-3, 1,-1,-1,0, 0,-3,-2, 1,_M, 1,-1, 0, 2, 1, 0,-1,-2, 0,-3, 0}, 

/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,_M, 0,-1,-1, 1, 3, 0, 0,-5, 0,-3, 0}, 

35 /* U */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,_M, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* V */ { 0,-2,-2,-2,-2,-1,-1,-2, 4, 0,-2, 2, 2,-2_M,-l, -2,-2,-1, 0, 0, 4,-6, 0,-2,-2}, 

/* W */ {-6,-5,-8,-7,-7, 0,-7,-3,-5, 0,-3, -2,-4,-4,_M ,-6,-5, 2,-2,-5, 0,-6,17, 0, 0,-6}, 

/* X */ { 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,Jvl, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 

/* Y */ {-3,-3, 0,-4,-4, 7,-5, 0,-1, 0,-4,- l,-2,-2,_M, -5, -4,-4,-3, -3, 0,-2, 0, 0,10,-4}, 

40 /* Z */ { 0, 1,-5, 2, 3,-5, 0, 2,-2, 0, 0,-2,-1, 1,__M, 0, 3, 0, 0, 0, 0,-2,-6, 0,-4, 4} 

}; 
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/* 
*/ 



Table 1 (conf) 



ffinclude <stdio.h> 




#include <ctype.h> 




#define MAXJMP 


16 


tfdefine MAXGAP 


24 


#define JMPS 


1024 


#define MX 


4 


#define DMAT 


3 


#define DMIS 


0 


#define DINSO 


8 


#define DINS1 


1 


#define PINSO 


8 


#define PINS1 


4 


struct jmp { 





}■■ 



short 

unsigned short 



struct diag { 
int 
long 
short 
struct jmp 

}; 



/* max jumps in a diag */ 

/* don't continue to penalize gaps larger than this */ 
/* max jmps in an path */ 

/* save if there's at least MX-1 bases since last jmp */ 

/* value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue */ 



n[MAXJMP]; 
xfMAXJMP]; 



score; 
offset; 
ijmp; 

jp; 



/* size of jn:D (neg for dely) */ 
/* base no. of jmp in seq x */ 
/* limits seq to 2 A 16 -1 */ 



/* score at last jmp */ 
/* offset of prev block*/ 
/* current jmp index */ 
/* list of jmps */ 



struct path { 

int spc; /* number of leading spaces */ 

short n[JMPS];/*sizeofjmp(gap)*/ 



int x[JMPS]; /* loc of jmp (last elem before gap) */ 

}; 

char *ofile; /* output file name */ 

c fl ar *namex[2); /* seq names: getseqsQ */ 

char *prog; /* prog name for err msgs */ 

char *seqx[2]; /* seqs: getseqs() */ 

int dmax; /* best diag: nwO */ 

int dmaxO; /* final diag */ 

int dna; /* set if dna: mainQ */ 

int endgaps; /* set if penalizing end gaps */ 

int gapx, gapy; /* total gaps in seqs */ 

int lenO.lenl; /* seq lens*/ 

int ngapx, ngapy; /* total size of gaps */ 

int smax; /* max score: nw() */ 

int *xbm; /* bitmap for matching */ 

!on S offset; /* current offset in jmp file */ 

struct diag *dx; /* holds diagonals */ 

struct path pp[2]; /* holds path for seqs */ 

char *calloc(), *malloc0, *index(), *strcpy(); 

char *getseq0, *g_calloc0; 
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Table 1 (conV) 

/* Needleman-Wunsch alignment program 



* usage: progs filel file2 

5 * where filel and file2 are two dna or two protein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with V, *>' or are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 
10 * Output is in the file "align.out" 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

15 #include "nw.h' 
include "day.h* 



static _dbval[26] = { 

1,14,2,13,0,0,4,11,0,0,12,0,3,15,0,0,0,5,6,8,8,7,9,0,10,0 

20 }; \ 

static _pbval[26] = { 

1, ^(l^CD'-'A^la^CN'-'A 1 )), 4, 8, 16, 32, 64, 
128, 256, OxFFFFFFF, 1«10, 1«11, 1«12, 1«13, 1«14, 
25 1«15, 1«16, 1«17, 1«18, 1«19, 1«20, 1«21, 1«22, 

1«23, 1«24, l^SRl^K-WttKl^CQ'-'A')) 

}; 

main(ac, av) 

30 main 



{ 



int ac; 
char *av[ ]; 



prog = av[0]; 
35 if(ac!=3){ 

fprintf(stderr "usage: %s filel file2\n", prog); 

fprintf(stderr,"where filel and file2 are two dna or two protein sequences An"); 
fprintf(stderr,"The sequences can be in upper- or lower-case\n"); 
fprintf(stderr "Any lines beginning with V or *<' are ignoredW); 
40 fprintf(stderr,"Output is in the file \"align.out\"\n"); 

exit(l); 

} 

namex[0] = av[l]; 
namex[l] = av[21; 
45 seqx[0] = getseq(namex[0], &len0); 

seqxfl] =getseq(namex[l], &lenl); 
xbm = (dna)? .dbval : _pbval; 

endgaps = 0; /* 1 to penalize endgaps */ 

50 ofile = "align.out"; /* output file */ 

nwO; /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

printQ; /* print stats, alignment */ 



cleanup(O); /* unlink any tmp files */ 



} 
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Table 1 (cont ? ) 

/* do the alignment, return best score: main() 

* dna: values in Fitch and Smith, PNAS, 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
•7 

nw() 

nw 

/* seqs and ptrs */ 
/* keep track of dely */ 
/* keep track of delx */ 
/* for swapping rowO, row I */ 
/* score for each type */ 
/* insertion penalties */ 
/* diagonal index */ 

/*jmp index */ i 
/* score for curr, last row */ ^ 
/* index into seqs */ 

dx - (struct diag *)g_calloc("to get diags", len0+lenl+l, sizeof(struct diag)); 

25 ndely = (int *)g_calloc( u to get ndely", lenl+1, sizeof(int)); 

dely = (int *)g_calioc("to get dely", lenl+1 , sizeof(int)); 

colO = (int *)g_calloc("to get col0'\ lenl+1, sizeof(int)); 

coll = (int *)g_calloc("to get coll", lenl+1, sizeof(int)); 

insO = (dna)? DINS0 : PINS0; 
30 insl = (dna)? D1NS1 : P1NS1; 

smax = -10000; 
if(endgaps){ 

for (col0[0] = dely[0] = -insO, yy = 1 ; yy <= lenl; yy++) { 
35 col0[yy] = delylyy] = col0[yy-l] - insl; 

ndely[yy] = yy; 

col0[0] = 0; /* Waterman Bull Math Biol 84 */ 



to 


nw 




{ 




*px, *py; 




char 




int 


*ndely, *dely; 




int 


ndelx, delx; 


15 


int 


*tmp; 




int 


mis; 




int 


insO, insl; 




register 


id; 




register 


* * 

u; 

*col0, *coil; 


20 


register 




register 


xx, yy; 



} 

40 else 



for (yy = 1; yy <= lenl; yy++) 
dely[yy] = -insO; 



/* fill in match matrix 

45 */ 

for (px = seqx[0]; xx = 1; xx <= ienO; px++, xx++) { 

/* initialize first entry in col 
*/ 

if (endgaps) { 
50 if(xx=l) 

coll[0] = delx = -(insO+insl); 

else 

coll[0] = delx = col0[0] - insl; 
ndelx = xx; 



55 } 

else{ 



coll[0J=0; 
delx = -insO; 
ndelx = 0; 



60 } 
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Table 1 (conO 

...ivw 

for (py = seqx[l], yy = l; yy <= tenl; py++, yy++) { 
mis = colO[yy-l]; 
if(dna) 

mis += (xbm[*px- , Al&xbm[ !ls py- , A r l)? DMAT : DMIS; 

else 

mis +=_day[*px-'A'K*py-'A']; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps || ndely[yy] < MAXGAP) { 

if (col0[yyj - insO >= dely[yy]) { 

dely[yy] = col0[yy] - (insO+insl); 
ndely[yy] = 1; 

} else { 

dely[yyl -=insl; 
ndely[yy]++; 

} • 

} else { 

if (colOfyy] - (insO+insl) >= dely[yy]) { 

dely[yy] = col0[yy] - (insO+insl); 
ndely[yy] = i; 

} else 

ndely[yy]-H-; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps || ndelx < MAXGAP) { 

if(coll[yy-l]-ins0>=delx){ 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1 ; 

} else { 

delx -=insl; 
ndelx++; 

} 

} else { 

if (coll[yy-l] - (insO+insl) >= delx) { 

delx = coll[yy-l] - (insO+insl); 
ndelx = 1; 

} else 

ndelx++; 

} 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
*/ 
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Table 1 (cont>) 

...n\v 

id = xx - yy + lenl - 1; 
if (mis >= delx && mis >= dely[yy]) 
5 coll[yy] = mis; 

else if (delx >= dely[yy]) { 

coll[yy] = delx; 

ij = dx[id].ijmp; 

if (dxlid].jp.n[0| && (!dna || (ndelx >= MAXJMP 
10 && xx > dx[id].jp.x[ij]+MX) || mis > dx[id].score+DlNSO)) { 

dx[id].ijmp++; 
if (++ij >= MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 

15 dx[id].offset = offset; 

offset += siseof(struct jmp) + sizeof(offset); 

} 

} 

dx[id].jp.n[ij] = ndelx; 
20 dx[id]jp.x[ij] = xx; 

dx[id]. score = delx; 

} 

else { 

coll[yy] = dely[yy]; 
25 ij = dx[id].ijmp; 

if (dx[id].jp.n[0] && (!dna || (ndely[yy] >= MAXJMP 

&& xx > dx[id].jp.x[ij]+MX) || mis > dx[id].score+DINSO)) { 

dx[id].ijmp++; 

if (++ij >= MAXJMP) { 

30 writejmps(id); 

ij = dx[id].ijmp = 0; 
dx[id]. offset = offset; 

offset += sizeof(struct jmp) + sizeof(offset); 

} 

35 } 

dx[id].jp.n[ij] = -ndely[yy]; 

dx[id].jp.x[ij] = xx; 

dxfid]. score = dely[yy]; 

} 

40 if (xx == lenO && yy < lenl) { 

/* last col 
*/ 

if (endgaps) 

coll[yy] -= ins0+insl*(lenl-yy); 
45 if (coll[yy] >smax) { 

smax = coll[yy]; 
dmax - id; 

} 

} 

50 } 

if (endgaps && xx < lenO) 

coll[yy-l] -=ins0+insl*(len0-xx); 
if (coll[yy-l]>smax){ 

smax = coll[yy-l]; 
55 dmax = id; 

} 

tmp = colO; colO = coll; coll = tmp; 

} 

(void) free((char :;: )ndely); 
60 (void) free((char *)dely); 

(void) free((char ^JcoIO); 

(void) free((char s:5 )coll): } 
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Table 1 (conV) 

/* 

* printQ - only routine visible outside this module 

* 

* static: 

* getmatQ - trace back best path, count matches: printO 

* pr_align() - print alignment of described in array p[ ]: print() 

" :: dumpblockO - dump a block of lines with numbers, stars: pr„align() 

* numsO - put out a number line: dumpblockO 

* putlineO -- put out a line (name, [num], seq, [num]): dumpblockQ 

* starsQ - -put a line of stars: dumpblockO 

* stripnameO - strip any path and prefix from a seqname 
*/ 

include "nw.h" 
#define SPC 3 

tfdefine P_LINE 256 /* maximum output line */ 

#define P_SPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

int olen; /* set output line length */ 

FILE *fx; /* output file */ 

print() 



{ 



} 



print 

int Ix, ly, firstgap, Iastgap; /* overlap */ 

if ((fx = fopen(ofile, "w")) == 0) { 

fprintf(stderr "%s: can't write %s\n", prog, ofile); 
cleanup(l); 

} 

fprintf(fx, "<first sequence: %s (length = %d)\n'\ namex[0], lenO); 
fprintf(fx, "<second sequence: %s (length = %d)\n", namex[l], lenl); 
olen = 60; 
lx = lenO; 
ly = lenl; 

firstgap = Iastgap = 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 

pp[0].spc = firstgap = lenl - dmax - 1 ; 
ly -= pp[0].spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap = dmax - (lenl - 1); 
lx -= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
Iastgap = lenO - dmaxO -1; 
lx -= Iastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y S! 7 
Iastgap = dmaxO - (lenO - 1); 
ly -= Iastgap; 

} 

getmat(lx, ly, firstgap, Iastgap); 
pr_align(); 
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/* 

* trace back the best path, count matches 

*/ 

static 



Table 1 (conn 



getmat(lx, ly, rlrstgap, Iastgap) getmat 
| nt I*, /* "core" (minus endgaps) */ 

int firstgap, Iastgap; /* leading trailing overlap */ 



{ 



>nt nm,iO,il, sizO,sizl; 

char outx[32]; 

double pet; 

register nO,nl; 

register char *p0, *pl; 

/* get total matches, score 

*/ 

iO = il = sizO = sizl = 0; 
pO = seqx[0]+pp[l].spc; 
pl=seqx[l]+pp[0].spc; 
nO = pp[l].spc+ 1; 
nl = pp[0].spc + 1; 

nm = 0; 

while ( *p0 && *pl ) { 
if (sizO) { 

pl++; 

nl++; 
sizO— ; 

} 

else if (sizl){ 
pO-H-; 



} 

else { 



} 



} 



n0++; 
sizl--; 



if(xbm[*p(VA]&xbm[*pl-'A*]) 
nm++; 

if(nO++ = pp[0].x[iO]) 

sizO = pp[0].n[iO++]; 

if(nl++ = pp[l]. x [il]) 

sizl = pp[l].n[il++]; 

p0++; 
pl++; 



/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

lx = (lenO<lenl)?lenO:lenl; 

else 

lx = (lx < ly)? lx : ly; 

pct= 100.*(double)nm/(double)lx; 
fprintf(fx, V); 

fprintf(fx, "<%d match%s in an overlap of %d: %.2f percent similarity\n" 
nm,(nm= I)? "":"es", lx, pet); 
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Table 1 (conf) 

fprintf(rx, "<gaps in first sequence: %d'\ gapx); ...getmat 
if (gapx) { 

5 (void) sprintf(outx, " (%d %s%s)'\ 

ngapx, (dna)? "base": "residue", (ngapx = 1)? "":"s"); 
fprintf(fx,"%s", outx); 

fprintf(fx, *\ gaps in second sequence: %d", gapy); 
10 if (gapy) { 

(void) sprintf(outx, " (%d %s%s)", 

ngapy, (dna)? "base'Y'residue", (ngapy = 1)? ,M, :V); 
fprintf(fx,"%s", outx); 

} 

15 if (dna) 

fprintfCfx, 

"\n<score: %d (match = %d, mismatch = %d, gap penalty = %d + %d per base)\n", 
smax, DMAT, DMIS, DINSO, DINS1); 

else 

20 fprintf(fx, 

"\n<score: %d (Dayhoff PAM 250 matrix, gap penalty = %d + %d per residue)\n", 

smax,PINS0, PINS1); 

if (endgaps) 

fprintf(fx, 

25 "<endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 

firstgap, (dna)? "base" : "residue", (firstgap == 1)? "" : u s", 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? '"' : "s"); 

else 

fprintf(fx, "<endgaps not penalized\n"); 

30 } 

static nm; /* matches in core -- for checking */ 

static lmax; /* lengths of stripped file names */ 

static ij[2]; /* jmp index for a path */ 

static nc[2j; /* number at start of current line */ 

35 static ni[2]; /* current elem number - for gapping */ 

static siz[2]; 

static char *ps[2]; /* ptr to current element */ 

static char *po[2] ; /* ptr to next output char slot */ 

static char out[2][P_LINE]; /* output line */ 

40 static char star[P_UNE] ; /* set by starsO */ 

/* 

* print alignment of described in struct path pp[ ] 
*/ 

45 static 

prjiigno pr.ahgn 

int nn; /* char count */ 

int more; 
50 register i; 

for (i = 0, lmax = 0; i < 2; i++) { 

nn = stripname(namex[i]); 
if (nn > lmax) 
55 lmax = nn; 

nc[i] = 1; 
ni[i] = 1; 
siz[i] = ij[i] = 0; 
60 ps[i] = seqx[i]; 

po[i] =out[i]; } 
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Table 1 fcont^ 

for (nn = nm = a more =1; more; ){ _ r „ 

for (i = more = 0; i < 2; i++) { ».pr_ailgn 

J ft- 

* do we have more of this sequence? 
if(!*ps[i]) 

jq continue; 

more++; 

■ 

if (PPtiJ.spc) { /* leading space */ 
15 *po[i]++ = "; 

pp[i].spc-s 

else if(siz[i]){ /* in a gap*/ 
*po[i]++=v ; 



20 



siz[i]~; 

} 

eke { /* we're putting a seq element 

*/ 

*po[i] » *p S [i]; 

25 if(islower(*ps[i])) 

*ps[i] = toupper(*ps[i]); 

po[i]++; 

ps[i]++; 



30 



/* 

* are we at next gap for this seq? 

*/ 

if Mi] == PP [i].x[ij[i]]) { 

/* 

35 * we need to merge all gaps 

* at this location 
*/ 

sizti] = pp[i].n[ij[i]++]; 
while (ni[i] = pp[i].x[ij[i]]) 
40 siz W += PPti].ntij[i]++]; 



50 



} 



} 



ni[i]++; 



} 

4 - if (++nn = olen || !more && nn) { 

dumpblock(); 
for(i = 0;i<2; i++) 

po[i] = out[i]; 

nn = 0; 

} 



/* 

55 */ dUmp 3 bI ° Ck 0f lines> incIudm S numbers, stars: pr_align() 
static 

dumpblock() 

dumpblock 

{ 

60 register i; 

for (i = 0; i < 2; i++) 

*po[i]- = \0\ 
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Table 1 (conf) 



10 



15 



(void) putc(\n\ fx); 
for (i = 0;i<2; i++) { 

if (*out[i] && (*out[i] != ' Ml *(po[i]) !=")){ 
if (i == 0) 

nums(i); 
if (i = 0 && *out[L]) 
starsO; 

putline(i); 

if(j = 0&& *out[l]) 

fprintf(fx, star); 

if(i = l) 

nums(i); 

} 

} 



...dumpblock 



20 /* 



25 



30 



35 



40 



45 



50 



\ 



* put out a number line: dumpblockO 
*/ 

static 

nums(ix) 

int ix; /* index in out[ ] holding seq line */ 

{ 

char nlinetPJUNE]; 

register i, j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < lmax+P_SPC; i++, pn++) 
*pn = ' '; 

for (i = nc[ix], py = out[ix]; *py; py++, pn++) { 
if (* py == * Ml *PY= V> 



else{ 



*pn - , 



if (i%10 == 0 || (i = 1 && nc[ix] != 1)) { 
j = (i<0)?-i:i; 
for (px = pn; j; j /= 10, px~) 
*px=j%lO + 1 0 , ; 

if(i<0) 

*px = -'; 



} 

else 
i++; 



*pn = ' 



} 



} 

*pn = \0'; 
nc[ix] = i; 

for (pn = nline; *pn; pn++) 

(void) putc(*pn, fx); 
(void) putc(\n\ fx); 



nums 



55 



/* 

* put out a line (name, [num], seq, [num]): dumpblockO 

*/ 

static 



60 putline(ix) 



int 



ix; 



{ 



putline 
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Table 1 fcont') 



int ,. ...putline 

register char *px; 

for (px = namex[ix), i = 0; *px && * px != V; px++, 

(void) putc(*px, fx); 
for (; i < lmax+P_SPC; 

(void) putc(' \ fx); 

/* these count from 1 : 

* ni[ ] is current element (from 1) 

* nc[ ] is number at start of current line 

*/ 

for (px = outpx]; *px; px++) 

(void) putc(*px&0x7F, fx); 
(void) putc(\n', fx); 

} 



/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblock() 

static 

stars() 

stars 

{ 



int i; 

register char * p 0, *pl , cx, *px; 



(*out[0] == " && *(po[0]) = ' 0 || 
(*out[ !] = "&& *(po[l]) = '0) 



if(!*out[0] 
!*out[l] 

return; 
px = star; 

for (i = lmax+P_SPC; i; i~) 
*px++ = ' *; 

for (pO = out[0], pi = out[lJ; *p0 && *pl; p 0++, pl++) { 
if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO-'Al&xbm[*pl-»A]) { 
cx = *'; 
nm++; 

} 

else if (Idna && ^day^pO-'AK^pl-'Al > 0) 
cx = V; 

else 

cx = 1 

} 

else 



cx = 



} 



> 

*px++ = \n'; 
*px = \0 y ; 



*px++ = cx; 
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Table 1 (conV) 

/* 

* strip path or prefix from pn, return len: pr_ahgn() 
*/ 

static 

stripname(pn) 

stripname 

char *pn; /* file name (may be path) */ 

{ 

register char *px, *py; 
py = 0; 

for (px = pn; *px; px++) 
if (*px == V) 

py = px + l; 

if (py) 

(void) strcpy(pn, py); 
return(strlen(pn)); 
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10 



Table 1 (conf) 



/* 

* cleanupO - cleanup any tmp file 

* getseqO - read in seq, set dna, len, raaxlen 

* g_calloc() - calloc() with error checkin 

* readjmps() ~ get the good jmps, from tmp file if necessary 

* wrilejmpsO - write a filled array of jmps to a tmp file: mv() 
*/ 

#include "nw.h" 
#include <sys/file.h> 



15 



char 
FILE 

int 
long 



*jname = 7tmp/homgXXXXXX'*; 
*fj; 



cleanupO; 
lseekQ; 



/* tmp File for jmps */ 



/* cleanup tmp file */ 



20 



25 



/* 

* remove any tmp file if we blow 

*/ 



cleanup (i) 
{ 



int 

if 09) 



i; 



exit(i); 



(void) unlink(jname); 



cleanup 



30 



35 



40 



45 



50 
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60 



/* 

* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with '<\ or '>' 

* seq in upper or lower case 
*/ 

char * 



getseq(file, len) 

char *file; 
int *len; 

{ 

char 

register char 

int 

FILE 



/* file name */ 
/* seq len */ 

line[1024], *pseq; 
*px, *py; 
natgc, tlen; 
*fp; 



getseq 



if ((fp = fopen(file,"r")) = 0) { 

fprintf(stderr,"%s: can't read %s\n'\ prog, file); 
exit(l); 

} 

tlen = natgc = 0; 

while (fgetsGine, 1024, fp)) { 

if (*line = ';■ || *line '<' || *line = V) 

continue; 
for (px = line; *px != An'; px++) 

if (isupper(*px) || islower(*px)) 
tlen++; 

} 

if ((pseq = malloc((unsigned)(tIen+6))) = 0) { 

fprintf(stderr,"%s: malIoc() failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pseq[0] = pseq[l] = pseq[2] = pseq[3] = \0'; 
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Table 1 (conV) 



•♦.getseq 



py = pseq + 4; 
*len = tlen; 
rewind(fp); 

while (fgets(linc, 1024, fp)) { 

if (*line = V |j *line = '<' || *Iine = 

continue; 
for (px = line; *px != W; px++) { 
if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py++ = toupper(*px); 
if(index("ATGCU"*(py-l))) 
natgc++; 

} 

} 

*py = \0'; 
(void) fclose(fp); 
dna = natgc > (tlen/3); 
return(pseq+4); 



} 



char 



g_calloc(msg, nx, sz) 

char *msg; /* program, calling routine */ 



gcalloc 



{ 



int nx, sz; /* number and size of elements */ 

char *px, *calioc(); 

if ((px = calloc((unsigned)nx } (unsigned)sz)) == 0) { 
if(*msg){ 

fprintf(stderr, "%s: g_calloc0 failed %s (n=%d, sz=%d)\n", prog, msg, nx, sz); 
exit(l); 

} 

} 

return(px); 



} 



/* 

* get final jmps from dx[ ] or tmp file, set pp[ ], reset dmax: mainO 
*/ 

readjmpsO 

readjmps 

{ 

int fd = -l; 

int siz,iO,il; 
register i,j,xx; 

if(rj){ 

(void) fclose(fj); 

if ((fd = open(jname, 0_RDONLY, 0)) < 0) { 

fprintf(stderr, "%s: can't open() %s\n", prog, jname); 
cleanup(l); 

} 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; { 
while (1){ 

for 0' = dx[dmax].ijmp; j >= 0 && dx[dmax].jp.x[j] >= xx; j-) 
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Table 1 (cont 5 ) 

...readjmps 

if G < 0 && dx[dmaxj .offset && fj) { 

(void) lseek(fd, dx[dmax].offset, 0); 

(void) readffd, (char *)&dx[dmaxj.jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax].offset, sizeof(dx[dmax]. offset)); 

dx[dmax].ijmp = MAX JMP- 1 ; 

} 

else 

break; 

} 

if (i >= JMPS) { 

fprintf(stderr, "%s: too many gaps in alignmentVT, prog); 
cleanup(l); 

} 

if (j>= 0){ 

siz = dx[dmax].jp.n[j]; 
xx = dx[dmax].jp.x[j]; 
dmax += siz; 

if (siz < 0) { /* gap in second seq */ 

PP[lJ.n[il]=-siz; 
xx += siz; 

/* id = xx - yy + lenl - 1 

*/ 

pp[l].x[il] = xx - dmax + lenl - 1; 
gapy-H-; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP || endgaps)? -siz : MAXGAP; 
il++; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[i0] = siz; 
pp[0].x[iO] = xx; 
gapx++; 
ngapx += siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (siz < MAXGAP || endgaps)? siz : MAXGAP; 
i0++; 

} 

} 

else 

break; 

} 

/* reverse the order of jmps 
*/ 

for (j = 0, i0~; j < iO; j++, i0~) { 

i = pp[0].n[j]; pp[0].n[j] = pp[0].n[i0]; pp[0].n[i0] = i; 
^ i = Pp[0J.xBI; pp[0].x[j] = pp[0].x[i0]; pp[0].x[i0] = i; 

for(j=ail-;j<il;j++,il-){ 

i = PP[l]-n|j]; pp[l].nQ] = pp[l].n[il]; pp[l].n[il] = i; 
^ i = pp[l].x[j]; pp[l].x[j] = pp[l].x[il]; pp[l].x[il] = i; 

if (fd >= 0) 

(void) close(fd); 

it mi 

(void) unlink(jname); 
fj = 0; 
offset = 0; 

> ) 
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Table 1 (conf) 

/* 

* write a filled jmp struct offset of the prev one (if any): nwO 

5 */ 

writejmps(ix) 

writejmps 

int ix; 

{ 

10 char *mktemp(); 

if(Hj){ 

if(mktempGname)<0){ 

fprintf(stderr, "%s: can't mktempO %s\n", prog, jname); 

15 cleanup(l); 

if ((f] = fopenGname, V)) == 0) { 

fprintf(stderr, "%s: can't write %s\n", prog, jname); 

exit(l); 

20 s } 
v } 

(void) fwrite((char *)&dx[ix] jp, sizeof(struct jmp), 1 , fj); 
(void) fwrite((char *)&dx[ix].offset, sizeof(dx[ix] .offset), 1, fj); 

} 

25 
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Table 2 

5 PRO XXXXXXXXXXXXXXX (Length = 15 amino acids) 

Comparison Protein XXXXXYY Y YYYY (Length = 1 2 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as 
10 determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 
5 divided by 15 = 33.3% 

Table 3 

15 PRO XXXXXXXXXX (Length = 1 0 amino acids) 

Comparison Protein XXXXXYYYYYYZZYZ (Length = 15 amino acids) 

% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide sequences as 
20 determined by ALIGN-2) divided by (the total number of amino acid residues of the PRO polypeptide) = 

5 divided by 10 = 50% 

Table 4 

25 PRO-DNA NNr^nSfNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 1 6 nucleotides) 

% nucleic acid sequence identity = 

30 (the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 

6 divided by 14 = 42.9% 

Table 5 

35 

PRO-DNA NNNNNNNNNNNN (Length = 1 2 nucleotides) 

Comparison DNA NNNNLLLVV (Length = 9 nucleotides) 

% nucleic acid sequence identity = 

40 
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(the number of identically matching nucleotides between the two nucleic acid sequences as determined by 
ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic acid sequence) = 
4 divided by 12 = 33.3% 

5 II. Compositions and Methods of the Invention 

A. Full-Length PRO Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding 
polypeptides referred to in the present application as PRO polypeptides. In particular, cDNAs encoding 
various PRO polypeptides have been identified and isolated, as disclosed in further detail in the Examples 
10 below. However, for sake of simplicity, in the present specification the protein encoded by the full length 
native nucleic acid molecules disclosed herein as well as all further native homologues and variants included 
in the foregoing definition of PRO, will be referred to as "PRO/number", regardless of their origin or mode 

of preparation. \ 

As disclosed in the Examples below, the sequence of various cDNA clones have been disclosed. 
15 The predicted amino acid sequence can be determined from the nucleotide sequence using routine skill. For 
the PRO polypeptides and encoding nucleic acids described herein, Applicants have identified what is 
believed to be the reading frame best identifiable with the sequence information available at the time. 

B. PRO Polypeptide Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated 

20 that PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide 
changes into the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art 
will appreciate that amino acid changes may alter post-translational processes of the PRO, such as changing 
the number or position of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described 

25 herein, can be made, for example, using any of the techniques and guidelines for conservative and non- 
conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a 
substitution, deletion or insertion of one or more codons encoding the PRO that results in a change in the 
amino acid sequence of the PRO as compared with the native sequence PRO. Optionally, the variation is by 
substitution of at least one amino acid with any other amino acid in one or more of the domains of the PRO. 

30 Guidance in determining which amino acid residue may be inserted, substituted or deleted without adversely 
affecting the desired activity may be found by comparing the sequence of the PRO with that of homologous 
known protein molecules and minimizing the number of amino acid sequence changes made in regions of 
high homology. Amino acid substitutions can be the result of replacing one amino acid with another amino 
acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, 

35 i.e., conservative amino acid replacements. Insertions or deletions may optionally be in the range of about 1 
to 5 amino acids. The variation allowed may be determined by systematically making insertions, deletions 
or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the 
full-length or mature native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N- 

40 terminus or C-terminus, or may lack internal residues, for example, when compared with a full length native 
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protein. Certain fragments lack amino acid residues that are not essential for a desired biological activity of 
the PRO polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
5 enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment 
encoding a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define 
the desired termini of the DNA fragment are employed at the 5 ' and 3 ' primers in the PCR. Preferably, PRO 
10 polypeptide fragments share at least one biological and/or immunological activity with the native PRO 
polypeptide disclosed herein. 

In particular embodiments, conservative substitutions of interest are shown in Table 6 under the 
heading of preferred substitutions. If such substitutions result in a change in biological activity, then more 
substantial changes, denominated exemplary substitutions in Table 6, or as further described below in 
15 reference to amino acid classes, are introduced and the products screened. 

Table 6 





Original 
Residue 


Exemplary 
Substitutions 


Preferred 
Substitutions 


20 


Ala (A) 


val; leu; ile 


val 




Arg (R) 


lys; gin; asn 


lys 




Asn (N) 


gin; his; lys; arg 


gin 




Asp (D) 


glu 


glu 




Cys (C) 


ser 


ser 


25 


Gin (Q) 


asn 


asn 




Glu (E) 


asp 


asp 




Gly (G) 


pro; ala 


ala 




His (H) 


asn; gin; lys; arg 


arg 




lie (I) 


leu; val; met; ala; phe; 




30 




norleucine 


leu 




Leu (L) 


norleucine; ile; val; 








met; ala; phe 


ile 




Lys (K) 


arg; gin; asn 


arg 




Met (M) 


leu; phe; ile 


leu 


35 


Phe(F) 


leu; val; ile; ala; tyr 


leu 




Pro (P) 


ala 


ala 




Ser (S) 


thr 


thr 




Thr (T) 


ser 


ser 




Trp(W) 


tyr; phe 


tyr 


40 


Tyr(Y) 


trp; phe; thr; ser 


phe 




Val (V) 


ile; leu; met; phe; 








ala; norleucine 


leu 



Substantial modifications in function or immunological identity of the PRO polypeptide are 
45 accomplished by selecting substitutions that differ significantly in their effect on maintaining (a) the 
structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical 
conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side 
chain. Naturally occurring residues are divided into groups based on common side-chain properties: 
(1) hydrophobic: norleucine, met, ala, val, leu, ile; 
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(2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 
5 (6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another 
class. Such substituted residues also may be introduced into the conservative substitution sites or, more 
preferably, into the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide- mediated (site- 
10 directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et aL, 
NucL Acids Res., 13:4331 (1986); Zoller et al., Nucl. Acids Res., 10:6487 (1987)], cassette mutagenesis 
[Wells et al., Gene , 34:315 (1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. 
London SerA, 317:415 (1986)] or other known techniques can be perl ormed on the cloned DNA to produce 
the PRO variant DNA. 

15 Scanning amino acid analysis can also be employed to identify one or more amino acids along a 

contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. 
Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning 
amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to 
alter the main-chain conformation of the variant [Cunningham and Wells, Science , 244: 1081-1085 (1989)]. 

20 Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found 
in both buried and exposed positions [Creighton, The Proteins , (W.H. Freeman & Co., N.Y.); Chothia, I 
Mol. Biol. , 150 :1 (1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric 
amino acid can be used. 

C. Modifications of PRO 

25 Covalent modifications of PRO are included within the scope of this invention. One type of 

covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic 
derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues of 
the PRO. Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water- 
insoluble support matrix or surface for use in the method for purifying anti-PRO antibodies, and vice-versa. 

30 Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, 
including disuccinimidyl esters such as 3,3 -dithiobis(succinimidylpropionate), bifunctional maleimides such 
as bis-N-maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to the 

35 corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the a-amino groups of 
lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: Structure and M olecular Properties, 
W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation 
of any C-terminal carboxyl group. 
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Another type of covalent modification of the PRO polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native 
glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties 
found in native sequence PRO (either by removing the underlying glycosylation site or by deleting the 
5 glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are 
not present in the native sequence PRO. In addition, the phrase includes qualitative changes in the 
glycosylation of the native proteins, involving a change in the nature and proportions of the various 
carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino 
10 acid sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more 
serine or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino 
acid sequence may optionally be altered through changes at the DNA level, particularly by mutating the 
DNA encoding the PRO polypeptide at preselected bases such that codons are generated that will translate 
into the desired amino acids. 

15 Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 

chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, 
e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem. , 
pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished 
20 chemically or enzymatically or by mutational substitution of codons encoding for amino acid residues that 
serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and described, 
for instance, by Hakimuddin, et al., Arch. Biochem. Biophvs. , 259 :52 (1987) and by Edge et al., Anal. 
Biochem., 118:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved 
by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. EnzvmoL 
25 138:350 (1987). 

Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a 
variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or 
polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 
4,791,192 or 4,179,337. 

30 The PRO of the present invention may also be modified in a way to form a chimeric molecule 

comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag 
polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope- tagged 

35 forms of the PRO can be detected using an antibody against the tag polypeptide. Also, provision of the 
epitope tag enables the PRO to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. Various tag polypeptides and their respective 
antibodies are well known in the art. Examples include poly-histidine (poly-his) or poly-histidine-glycine 
(poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol., 8:2159- 

40 2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., 
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Molecular and Cellular Biology . 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) 
tag and its antibody [Paborsky et al., Protein Engineering , 3(6):547-553 (1990)]. Other tag polypeptides 
include the Flag-peptide [Hopp et al., BioTechnology. 6:1204-1210 (1988)]; the KT3 epitope peptide 
[Martin et al., Science. 255:192-194 (1992)]; an alpha-tubulin epitope peptide [Skinner et al., J. Biol. Chem. . 
5 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyermuth et al., Proa Natl. Acad. 
Sci. USA . 82:6393-6397 (1990)]. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule 
(also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The 

10 Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) 
form of a PRO polypeptide in place of at least one variable region within an Ig molecule. In a particularly 
preferred embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, 
CH2 and CH3 regions of an IgG 1 molecule. For the production of immunoglobulin fusions see also US 
Patent No. 5,428,130 issued June 27, 1995. 

15 D. Preparation of PRO 

The description below relates primarily to production of PRO by culturing cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative 
methods, which are well known in the art, may be employed to prepare PRO. For instance, the PRO 
sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques 

20 [see, e.g., Stewart et al., Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco, CA (1969); 
Merrifield, J. Am. Chem. Soc . 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an 
Applied Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various 
portions of the PRO may be chemically synthesized separately and combined using chemical or enzymatic 

25 methods to produce the full-length PRO. 

1. Isolation of DNA Encoding PRO 
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to 
possess the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be 
conveniently obtained from a cDNA library prepared from human tissue, such as described in the Examples. 

30 The PRO-encoding gene may also be obtained from a genomic library or by known synthetic procedures 
(e.g., automated nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the 
cDNA or genomic library with the selected probe may be conducted using standard procedures, such as 

35 described in Sambrook et al., Molecular Cloning: A Laboratory Manual (New York: Cold Spring Harbor 
Laboratory Press, 1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology 
[Sambrook et al., supra ; Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor 
Laboratory Press, 1995)]. 

The Examples below describe techniques for screening a cDNA library. The oligonucleotide 

40 sequences selected as probes should be of sufficient length and sufficiently unambiguous that false positives 
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are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization to 
DNA in the library being screened. Methods of labeling are well known in the art, and include the use of 
radiolabels like 32 P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including 
moderate stringency and high stringency, are provided in Sambrook et ah, supra . 

Sequences identified in such library screening methods can be compared and aligned to other 
known sequences deposited and available in public databases such as GenBank or other private sequence 
databases. Sequence identity (at either the amino acid or nucleotide level) within defined regions of the 
molecule or across the full-length sequence can be determined using methods known in the art and as 
described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or 
genomic libraries using the deduced amino acid sequence disclosed herein for the first time, and, if 
necessary, using conventional primer extension procedures as described in Sambrook et aL, supra , to detect 
precursors and processing intermediates of mRNA that may not have been re verse- transcribed into cDNA. 

2. Selection and Transformation of Host Cells 

Host cells are transfected or transformed with expression or cloning vectors described herein for 
PRO production and cultured in conventional nutrient media modified as appropriate for inducing 
promoters, selecting transformants, or amplifying the genes encoding the desired sequences. The culture 
conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan without undue 
experimentation. In general, principles, protocols, and practical techniques for maximizing the productivity 
of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach . M. Butler, ed. (IRL 
Press, 1991) and Sambrook et al., supra. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the 
ordinarily skilled artisan, for example, CaCl 2 , CaP0 4 , liposome-mediated and electroporation. Depending 
on the host cell used, transformation is performed using standard techniques appropriate to such cells. The 
calcium treatment employing calcium chloride, as described in Sambrook et al., supra , or electroporation is 
generally used for prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of 
certain plant cells, as described by Shaw et al., Gene. 23:315 (1983) and WO 89/05859 published 29 June 
1989. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham 
and van der Eb, Virology, 52:456-457 (1978) can be employed. General aspects of mammalian cell host 
system transfections have been described in U.S. Patent No. 4,399,216. Transformations into yeast are 
typically carried out according to the method of Van Solingen et al., IBact, 130:946 (1977) and Hsiao et 
Pfoc Natl. Acad. Sci. (USA). 76:3829 (1979). However, other methods for introducing DNA into cells, 
such as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or 
polycations, e.g., polybrene, polyornithine, may also be used. For various techniques for transforming 
mammalian cells, see Keown et al., Methods in Enzvmology , 185:527-537 (1990) and Mansour et al., 
Nature . 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, 
yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as 
Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as £. coll Various E. coii 
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strains are publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 
31,537); & coli strain W31 10 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host 
cells include Enterobacteriaceae such as Escherichia, e.g., E. co/i, Enterobacter, Erwinia, Klebsiella, 
Proteus, Salmonella, e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well 
5 as Bacilli such as B. subtilis and B. licheniformis (e.g., B. lichenifonnis 41 P disclosed in DD 266,710 
published 12 April 1989), Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are 
illustrative rather than limiting. Strain W3 1 10 is one particularly preferred host or parent host because it is a 
common host strain for recombinant DNA product fermentations. Preferably, the host cell secretes minimal 
amounts of proteolytic enzymes. For example, strain W3 1 10 may be modified to effect a genetic mutation 

10 in the genes encoding proteins endogenous to the host, with examples of such hosts including E. coli W31 10 
strain 1A2, which has the complete genotype tonA ; E. coli W31 10 strain 9E4, which has the complete 
genotype tonA ptr3; E. coli W31 10 strain 27C7 (ATCC 55,244), which has the complete genotype ton A ptr3 
phoA El 5 (argF-lac)169 degP ompT kan r \ E. coli W3110 strain 37D6, which has the complete genotype 
tonA ptr3 plwA E15 (argF-lac)169 degP ompT rbs7 ilvG kan r ; E. coli W3110 strain 40B4, which is strain 

15 37D6 with a non-kanamycin resistant degP deletion mutation; and an E. coli strain having mutant 
periplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. Alternatively, in vitro 
methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryottc microbes such as filamentous fungi or yeast are suitable 
cloning or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower 

20 eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature , 
290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; 
Fleer et al., Bio/Technology , 9:968-975 (1991)) such as, e.g., K. lactis (MW98-8C, CBS683, CBS4574; 
Louvencourt et al., J. BacterioL 154(2):737-742 [1983]), K, fragilis (ATCC 12,424), K. bulgaricus (ATCC 
16,045), K, mckeramii (ATCC 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van 

25 den Berg et al., Bio/Technology , 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 
402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J.Basic Microbiol., 28:265-278 [1988]); Candida; 
Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA , 76:5259- 
5263 [1979]); Schwanniomyces such as Schwann iomyces occidentalis (EP 394,538 published 31 October 
1990); and filamentous fungi such as, e.g., Neurospora, PenicUlium, Tolypocladium (WO 91/00357 

30 published 10 January 1991), and Aspergillus hosts such as A, nidulans (Ballance et al., Biochem. Biophys. 
Res. Commun., 112:284-289 [1983]; Tilburn et al., Gene, 26:205-221 [1983]; Yelton et al., Proc. Natl. 
Acad. Sci. USA. 81: 1470-1474 [1984]) and A. niger (Kelly and Hynes, EMBO J., 4:475-479 [1985]). 
Methylotropic yeasts are suitable herein and include, but are not limited to, yeast capable of growth on 
methanol selected from the genera consisting of Hansemda, Candida, Kloeckera, Pichia, Saccharomyces, 

35 Torulopsis, and Rhodotorula. A list of specific species that are exemplary of this class of yeasts may be 
found in C. Anthony, The Biochemistry of Methvlotrophs . 269 (1982). 

Suitable host cells for the expression of glycosylated PRO are derived from multicellular 
organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as 
well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and 

40 COS cells. More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC 
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CRL 1651); human embryonic kidney line (293 or 293 ceils subcloned for growth in suspension culture, 
Graham et ah, J. Gen ViroL , 36:59 (1977)); Chinese hamster ovary cellsADHFR (CHO, Urlaub and Chasin, 
Proc. Natl Acad. Sci. USA . 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod. . 23:243-251 
(1980)); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse 
mammary tumor (MMT 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be 
within the skill in the art. 

3. Selection and Use of a Replicable Vector 

The nucleic acid (e.g., cDNA or genomic DNA) encoding PRO may be inserted into a replicable 
vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The 
vector may, for example, be in the form of a plasmid, cosmid, viral particle, or phage. The appropriate 
nucleic acid sequence may be inserted into the vector by a variety of procedures. In general, DNA is 
inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. Vector 
components generally include, but are riot limited to, one or more of a signal sequence, an origin of 
replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination 
sequence. Construction of suitable vectors containing one or more of these components employs standard 
ligation techniques which are known to the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage 
site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a 
component of the vector, or it may be a part of the PRO-encoding DNA that is inserted into the vector. The 
signal sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline 
phosphatase, penicillinase, lpp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence 
may be, e.g., the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces ce- 
faclor leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. albicans 
glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 90/13646 
published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be used to 
direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or related 
species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to 
replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, 
and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, 
the 2u. plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or 
BPV) are useful for cloning vectors in mammalian ceils. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable 
marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., 
ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply 
critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that enable the 
identification of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine 
kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR 
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activity, prepared and propagated as described by Urlaub et al., Proc. Natl. Acad. Sci. USA , 77:4216 (1980). 
A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 [Stinchcomb et 
al., Nature, 282:39 (1979); Kingsman et al, Gene , 7:141 (1979); Tschemper et al., Gene , 10:157 (1980)]. 
The trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in 
5 tryptophan, for example, ATCC No. 44076 or PEP4- 1 [Jones, Genetics , 85: 12 (1 977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding 
nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells 
are well known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose 
promoter systems [Chang et al., Nature , 275:615 (1978); Goeddel et al, Nature , 281:544 (1979)], alkaline 

10 phosphatase, a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res. , 8:4057 (1980); EP 36,776], 
and hybrid promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA , 80:21-25 (1983)]. 
Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) sequence operably linked 
to the DNA encoding i?RO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 

15 phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem. , 255:2073 (1980)] or other glycolytic enzymes 
[Hess et al., J. Adv. Enzyme Reg., 7:149 (1968); Holland, Biochemistry , 17:4900 (1978)], such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. 

20 Other yeast promoters, which are inducible promoters having the additional advantage of 

transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, 
metallothionein, glyceraldehyde-3 -phosphate dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in EP 

25 73,657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 July 
1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the 

30 actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters 
are compatible with the host cell systems. 

Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an 
enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 
bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 

35 mammalian genes (globin, elastase, albumin, ot-fetoprotein, and insulin). Typically, however, one will use 
an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at 
a position 5 1 or 3* to the PRO coding sequence, but is preferably located at a site 5' from the promoter. 
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Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain 
5 nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA 
encoding PRO. 

Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in 
recombinant vertebrate cell culture are described in Gething et al., Nature, 293:620-625 (1981); Mantei et 
al., Nature , 281:40-46 (1979); EP 117,060; and EP 1 17,058. 
10 4. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. 
Natl Acad. Sci. USA . 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be 
15 employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be 
carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the 
presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
20 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to 
quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining 
and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. 
Conveniently, the antibodies may be prepared against a native sequence PRO polypeptide or against a 
synthetic peptide based on the DNA sequences provided herein or against exogenous sequence fused to PRO 
25 DNA and encoding a specific antibody epitope. 

5. Purification of Polypeptide 
Forms of PRO may be recovered from culture medium or from host cell iysates. If membrane- 
bound, it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by 
enzymatic cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical 
30 means, such as freeze-thaw cycling, sonication, mechanical disruption, or ceil lysing agents. 

It may be desired to purify PRO from recombinant ceil proteins or polypeptides. The following 
procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 
ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 
DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, 
35 Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating 
columns to bind epi tope-tagged forms of the PRO. Various methods of protein purification may be 
employed and such methods are known in the art and described for example in Deutscher, Methods in 
Enzvmologv , 182 (1990); Scopes, Protein Purification: Principles and Practice , Springer- Verlag, New York 
(1982). The purification step(s) selected will depend, for example, on the nature of the production process 
40 used and the particular PRO produced. 
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E. Tissue Distribution 

The location of tissues expressing the PRO can be identified by determining mRNA expression in 
various human tissues. The location of such genes provides information about which tissues are most likely 
to be affected by the stimulating and inhibiting activities of the PRO polypeptides. The location of a gene in 
a specific tissue also provides sample tissue for the activity blocking assays discussed below. 

As noted before, gene expression in various tissues may be measured by conventional Southern 
blotting, Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. Natl Acad. ScL USA, 
77:5201-5205 [1980]), dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be employed that can 
recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or 
DNA-protein duplexes. 

Gene expression in various tissues, alternatively, may be measured by immunological methods, 
such as immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to 
quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining 
and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. 
Conveniently, the antibodies may be prepared against a native sequence of a PRO polypeptide or against a 
synthetic peptide based on the DNA sequences encoding the PRO polypeptide or against an exogenous 
sequence fused to a DNA encoding a PRO polypeptide and encoding a specific antibody epitope. General 
techniques for generating antibodies, and special protocols for Northern blotting and in situ hybridization are 
provided below. 

F. Antibody Binding Studies 

The activity of the PRO polypeptides can be further verified by antibody binding studies, in which 
the ability of anti-PRO antibodies to inhibit the effect of the PRO polypeptides, respectively, on tissue cells 
is tested. Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate 
antibodies, the preparation of which will be described hereinbelow. 

Antibody binding studies may be carried out in any known assay method, such as competitive 
binding assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, Monoclonal 
Antibodies: A Manual of Techniques, pp.147-158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard to compete with the test sample 
analyte for binding with a limited amount of antibody. The amount of target protein in the test sample is 
inversely proportional to the amount of standard that becomes bound to the antibodies. To facilitate 
determining the amount of standard that becomes bound, the antibodies preferably are insolubilized before 
or after the competition, so that the standard and analyte that are bound to the antibodies may conveniently 
be separated from the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different 
immunogenic portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte 
is bound by a first antibody which is immobilized on a solid support, and thereafter a second antibody binds 
to the analyte, thus forming an insoluble three-part complex. See, e.g., US Pat No. 4,376,1 10. The second 
antibody may itself be labeled with a detectable moiety (direct sandwich assays) or may be measured using 
an anti-immunoglobulin antibody that is labeled with a detectable moiety (indirect sandwich assay). For 

56 



WO 2004/081199 



PCT/US2004/007862 



example, one type of sandwich assay is an ELISA assay, in which case the detectable moiety is an enzyme. 

For immunohistochemistry, the tissue sample may be fresh or frozen or may be embedded in 
paraffin and fixed with a preservative such as formalin, for example. 

G. Cell-Based Assays 

Cell-based assays and animal models for immune related diseases such as psoriasis can be used to 
further understand the relationship between the genes and polypeptides identified herein and the 
development and pathogenesis psoriasis. 

In a different approach, cells of a cell type known to be involved in immune related disease are 
transfected with the cDNAs described herein, and the ability of these cDNAs to stimulate or inhibit immune 
related disease is analyzed. Suitable cells can be transfected with the desired gene, and monitored for such 
functional activity. Such transfected cell lines can then be used to test the ability of poly- or monoclonal 
antibodies or antibody compositions to inhibit or stimulate psoraisis. Cells transfected with the coding 
sequences of the, genes identified herein can further be used to identify drug candidates for the treatment of 
immune related disease. 

In addition, primary cultures derived from transgenic animals (as described below) can be used in 
the cell-based assays herein, although stable cell lines are preferred. Techniques to derive continuous cell 
lines from transgenic animals are well known in the art (see, e.g., Small et al, Mol. Cell. Biol. 5: 642-648 
[1985]). 

H. Animal Models 

The results of cell based in vitro assays can be further verified using in vivo animal models and 
assays for immune related disease. A variety of well known animal models can be used to further 
understand the role of the genes identified herein in the development and pathogenesis of immune related 
disease, and to test the efficacy of candidate therapeutic agents, including antibodies, and other antagonists 
of the native polypeptides, including small molecule antagonists. The in vivo nature of such models makes 
them predictive of responses in human patients. Animal models of immune related diseases include both 
non-recombinant and recombinant (transgenic) animals. Non-recombinant animal models include, for 
example, rodent, e.g., murine models. Such models can be generated by introducing cells into syngeneic 
mice using standard techniques, e.g., subcutaneous injection, tail vein injection, spleen implantation, 
intraperitoneal implantation, implantation under the renal capsule, etc. 

Graft-versus-host disease occurs when immunocompetent cells are transplanted into 
immunosuppressed or tolerant patients. The donor cells recognize and respond to host antigens. The 
response can vary from life threatening severe inflammation to mild cases of diarrhea and weight loss. 
Graft-versus-host disease models provide a means of assessing T cell reactivity against MHC antigens and 
minor transplant antigens. A suitable procedure is described in detail in Current Protocols in Immunology, 
above, unit 4.3. 

An animal model for skin allograft rejection is a means of testing the ability of T cells to mediate in 
vivo tissue destruction and a measure of their role in transplant rejection. The most common and accepted 
models use murine tail-skin grafts. Repeated experiments have shown that skin allograft rejection is 
mediated by T cells, helper T cells and killer-effector T cells, and not antibodies. Auchincloss, H. Jr. and 
Sachs, D. H., Fundamental Immunology, 2nd ed., W. E. Paul ed., Raven Press, NY, 1989, 889-992. A 

57 



WO 2004/081199 



PCT/US2004/007862 



suitable procedure is described in detail in Current Protocols in Immunology, above, unit 4.4. Other 
transplant rejection models which can be used to test the compounds of the invention are the allogeneic heart 
transplant models described by Tanabe, M. et al, Transplantation (1994) 58:23 and Tinubu, S. A. et al, J. 
Immunol (1994) 4330-4338. 

5 Contact hypersensitivity is a simple delayed type hypersensitivity in vivo assay of cell mediated 

immune function. In this procedure, cutaneous exposure to exogenous haptens which gives rise to a delayed 
type hypersensitivity reaction which is measured and quantitated. Contact sensitivity involves an initial 
sensitizing phase followed by an elicitation phase. The elicitation phase occurs when the T lymphocytes 
encounter an antigen to which they have had previous contact. Swelling and inflammation occur, making 

10 this an excellent model of human allergic contact dermatitis. A suitable procedure is described in detail in 
Current Protocols in Immunology, Eds. J. E. Cologan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach 
and W. Strober, John Wiley & Sons, Inc., 1994, unit 4.2. See also Grabbe, S. and Schwarz, T, Immun. 
Today 19 (1): 37-44 (1998) . \ 

Additionally, the compounds of the invention can be tested on animal models for immune related 

15 disease like psoriasis. Evidence suggests a T cell pathogenesis for psoriasis, therefore the compounds of the 
invention can be tested in the scid/scid mouse model described by Schon, M. P. et al, Nat Med. (1997) 
3:183, in which the mice demonstrate histopathologic skin lesions resembling psoriasis. Another suitable 
model is the human skin/scid mouse chimera prepared as described by Nickoloff, B. J. et al. Am. J. Patk 
(1995) 146:580. 

20 Recombinant (transgenic) animal models can be engineered by introducing the coding portion of 

the genes identified herein into the genome of animals of interest, using standard techniques for producing 
transgenic animals. Animals that can serve as a target for transgenic manipulation include, without 
limitation, mice, rats, rabbits, guinea pigs, sheep, goats, pigs, and non-human primates, e.g., baboons, 
chimpanzees and monkeys. Techniques known in the art to introduce a transgene into such animals include 

25 pronucleic microinjection (Hoppe and Wanger, U.S. Patent No. 4,873,191); retrovirus-mediated gene 
transfer into germ lines (e.g., Van der Putten et al, Proc. Natl. Acad. Sci. USA §2, 6148-615 [1985]); gene 
targeting in embryonic stem cells (Thompson et al, Cell 56, 313-321 [1989]); electroporation of embryos 
(Lo, Mol Cel. Biol, 3, 1803-1814 [1983]); sperm-mediated gene transfer (Lavitrano et al, Cell 57, 717-73 
[1989]). For review, see, for example, U.S. Patent No. 4,736,866. 

30 For the purpose of the present invention, transgenic animals include those that carry the transgene 

only in part of their cells ("mosaic animals"). The transgene can be integrated either as a single transgene, or 
in concatamers, e.g., head-to-head or head-to-tail tandems. Selective introduction of a transgene into a 
particular cell type is also possible by following, for example, the technique of Lasko et al, Proc. Natl. 
Acad. Sci. USA 89, 6232-636 (1992). 

35 The expression of the transgene in transgenic animals can be monitored by standard techniques. 

For example, Southern blot analysis or PCR amplification can be used to verify the integration of the 
transgene. The level of mRNA expression can then be analyzed using techniques such as in situ 
hybridization, Northern blot analysis, PCR, or immunocytochemistry. 

The animals may be further examined for signs of immune disease pathology, for example by 

40 histological examination to determine infiltration of immune cells into specific tissues. Blocking 
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experiments can also be performed in which the transgenic animals are treated with the compounds of the 
invention to determine the extent of the T cell proliferation stimulation or inhibition of the compounds. In 
these experiments, blocking antibodies which bind to the PRO polypeptide, prepared as described above, are 
administered to the animal and the effect on immune function is determined. 

Alternatively, "knock out" animals can be constructed which have a defective or altered gene 
encoding a polypeptide identified herein, as a result of homologous recombination between the endogenous 
gene encoding the polypeptide and altered genomic DNA encoding the same polypeptide introduced into an 
embryonic cell of the animal. For example, cDNA encoding a particular polypeptide can be used to clone 
genomic DNA encoding that polypeptide in accordance with established techniques. A portion of the 
genomic DNA encoding a particular polypeptide can be deleted or replaced with another gene, such as a 
gene encoding a selectable marker which can be used to monitor integration. Typically, several kilobases of 
unaltered flanking DNA (both at the 5' and 3' ends) are included in the vector [see e.g., Thomas and 
Capecchi, Cell, 51:503 (1987) for a description of homologous recombination vectors]. The vector is 
introduced into an embryonic stem cell line {e.g., by electroporation) and cells in which the introduced DNA 
has homologously recombined with the endogenous DNA are selected [see e.g., Li et al., Cell, 69:915 
(1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) to form 
aggregation chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152], A chimeric embryo can then be 
implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create a 
"knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to 
defend against certain pathological conditions and for their development of pathological conditions due to 
absence of the polypeptide. 

I- ImmunoAdiuvant Therapy 

In one embodiment, the immunostimulating compounds of die invention can be used in 
immunoadjuvant therapy for the treatment of tumors (cancer). It is now well established that T cells 
recognize human tumor specific antigens. One group of tumor antigens, encoded by the MAGE, BAGE and 
GAGE families of genes, are silent in all adult normal tissues , but are expressed in significant amounts in 
tumors, such as melanomas, lung tumors, head and neck tumors, and bladder carcinomas. DeSmet, C. et al., 

(1996) Proc. Natl. Acad. Sci. USA, 93:7149. It has been shown that costimulation of T cells induces tumor 
regression and an antitumor response both in vitro and in vivo. Melero, I. et al., Nature Medicine (1997) 
3:682; Kwon, E. D. et al, Proc. Natl. Acad. Sci. USA (1997) 94: 8099; Lynch, D. H. et al, Nature Medicine 

(1997) 3:625; Finn, O. J. and Lotze, M. T., J. Immunol. (1998) 21:1 14. The stimulatory compounds of the 
invention can be administered as adjuvants, alone or together with a growth regulating agent, cytotoxic agent 
or chemotherapeutic agent, to stimulate T cell proliferation/activation and an antitumor response to tumor 
antigens. The growth regulating, cytotoxic, or chemotherapeutic agent may be administered in conventional 
amounts using known administration regimes. Immunostimulating activity by the compounds of the 
invention allows reduced amounts of the growth regulating, cytotoxic, or chemotherapeutic agents thereby 
potentially lowering the toxicity to the patient. 
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J. Screening Assays for Drug Candidates 

Screening assays for drug candidates are designed to identify compounds that bind to or complex 
with the polypeptides encoded by the genes identified herein or a biologically active fragment thereof, or 
otherwise interfere with the interaction of the encoded polypeptides with other cellular proteins. Such 
5 screening assays will include assays amenable to high-throughput screening of chemical libraries, making 
them particularly suitable for identifying small molecule drug candidates. Small molecules contemplated 
include synthetic organic or inorganic compounds, including peptides, preferably soluble peptides, 
(poly)peptide-immunoglobulin fusions, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and 

10 chimeric or humanized versions of such antibodies or fragments, as well as human antibodies and antibody 
fragments. The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays and cell based assays, which are well characterized in the art. 
All assays are common in that they call for contacting the drug candidate with a polypeptide encoded by a 
nucleic acid identified herein under conditions and for a time sufficient to allow these two components to 

15 interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in 
the reaction mixture. In a particular embodiment, the polypeptide encoded by the gene identified herein or 
the drug candidate is immobilized on a solid phase, e.g., on a microliter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution 

20 of the polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific 
for the polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by 
adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
component, e.g., the coated surface containing the anchored component When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 

25 detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labelled antibody 
specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular protein encoded by a gene 

30 identified herein, its interaction with that protein can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, cross-linking, co- 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, 
protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and 
co-workers [Fields and Song, Nature (London) 340, 245-246 (1989); Chien et ai, Proc. Natl. Acad. Sci 

35 USA 88, 9578-9582 (1991)] as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA 89, 5789-5793 
(1991). Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular 
domains, one acting as the DNA-binding domain, while the other one functioning as the transcription 
activation domain. The yeast expression system described in the foregoing publications (generally referred 
to as the "two-hybrid system") takes advantage of this property, and employs two hybrid proteins, one in 

40 which the target protein is fused to the DNA-binding domain of GAL4, and another, in which candidate 
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activating proteins are fused to the activation domain. The expression of a GALUacZ reporter gene under 
control of a GAL4-activated promoter depends on reconstitution of GAL4 activity via protein-protein 
interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for 0- 
galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein interactions between two 
specific proteins using the two-hybrid technique is commercially available from Clontech. This system can 
also be extended to map protein domains involved in specific protein interactions as well as to pinpoint 
amino acid residues that are crucial for these interactions. 

In order to find compounds that interfere with the interaction of a gene identified herein and other 
intra- or extracellular components can be tested, a reaction mixture is usually prepared containing the 
product of the gene and the intra- or extracellular component under conditions and for a time allowing for 
the interaction and binding of the two products. To test the ability of a test compound to inhibit binding, the 
reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added 
to a third reaction mixture, to serve as positive control. The binding (complex formation) between the test 
compound and the intra- or extracellular component present in the mixture is monitored as described above. 
The formation of a complex in the control reaction(s) but not in the reaction mixture containing the test 
compound indicates that the test compound interferes with the interaction of the test compound and its 
reaction partner. 

K - Compositions and Meth ods for Treatment 

The compositions useful in the treatment of immune related disease include, without limitation, 
proteins, antibodies, small organic molecules, peptides, phosphopeptides, antisense and ribozyme molecules, 
triple helix molecules, etc. that inhibit immune function, for example, T cell proliferation/activation, 
lymphokine release, or immune cell infiltration. 

For example, antisense RNA and RNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g., between about -10 and +10 
positions of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozyrnes act by sequence-specific hybridization to the complementary target RNA, followed by 
endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified 
by known techniques. For further details see, e.g., Rossi, Current Biology 4, 469-471 (1994), and PCT 
publication No. WO 97/33551 (published September 18, 1997). 

Nucleic acid molecules in triple helix formation used to inhibit transcription should be single- 
stranded and composed of deoxynucleotides. The base composition of these oligonucleotides is designed 
such that it promotes triple helix formation via Hoogsteen base pairing rules, which generally require 
sizeable stretches of purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT 
publication No. WO 97/33551, supra. 

These molecules can be identified by any or any combination of the screening assays discussed 
above and/or by any other screening techniques well known for those skilled in the art. 
L. Anti-PRO Antibodies 
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The present invention further provides anti-PRO antibodies. Exemplary antibodies include 
polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

L Polyclonal Antibodies 
The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, 
by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing 
agent and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to 
conjugate the immunizing agent to a protein known to be immunogenic in the mammal being immunized. 
Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be 
employed include Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in the art without 

undue experimentation. 

2. Monoclonal Antibodies 

The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may 
be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 256:495 
(1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized 
with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will 
specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes CPBLs") are used if cells of human origin are desired, or 
spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes 
are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to 
form a hybridoma cell [Goding, Monoclonal Antibodies: Pri nciples and Practice, Academic Press, (1986) 
pp. 59-103]. Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of 
rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma 
cells may be cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the 
enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the 
hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression 
of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. 
More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from 
the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described 
for the production of human monoclonal antibodies [Kozbor, J. Immunol., 133:3001 (1984); Brodeur et al., 
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Monoclonal Antibody Production Techniques and Applications , Marcel Dekker, Inc., New York, (1987) pp. 
51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence 
of monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
5 produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such 
as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELTSA). Such techniques and 
assays are known in the art. The binding affinity of the monoclonal antibody can, for example, be 
determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem. , 107:220 0980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
10 procedures and grown by standard methods [Goding, supral. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMM640 medium. Alternatively, the 
hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, 
15 protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity 
chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be 
readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are 

20 capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The 
hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be 
placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese 
hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may be 

25 modified, for example, by substituting the coding sequence for human heavy and light chain constant 
domains in place of the homologous murine sequences [U.S. Patent No. 4,816,567; Morrison et al., supral or 
by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non- 
immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
domains of an antibody of the invention, or can be substituted for the variable domains of one antigen- 

30 combining site of an antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are 
well known in the art. For example, one method involves recombinant expression of immunoglobulin light 
chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as 
to prevent heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with 

35 another amino acid residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques 
known in the art. 

3. Human and Humanized Antibodies 
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The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab , )2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 

5 Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Humanized antibodies may also comprise residues which are found 

10 neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in 
which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and 
all or substantially all o^ the FR regions are those of a human immunoglobulin consensus sequence. The 
humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region 

15 (Fc), typically that of a human immunoglobulin [Jones et al., Nature . 321 :522-525 (1986); Riechmann et al., 
Nature , 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. , 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These 
non-human amino acid residues are often referred to as "import" residues, which are typically taken from an 

20 "import" variable domain. Humanization can be essentially performed following the method of Winter and 
co-workers [Jones et al., Nature , 321:522-525 (1986); Riechmann et al., Nature , 332:323-327 (1988); 
Verhoeyen et al., Science , 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 
antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable domain has 

25 been substituted by the corresponding sequence from a non-human species. In practice, humanized 
antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are 
substituted by residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mol. Biol. , 227:381 (1991); Marks et al., J. Mol. Biol, 

30 222:581 (1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of 
human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy . Alan R. Liss, p. 77 
(1985) and Boerner et al., J. Immunol. , 147(l) :86-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody 

35 production is observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent 
Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific 
publications: Marks et al, Bio/Technology 10, 779-783 (1992); Lonberg et al, Nature 368 856-859 (1994); 
Morrison. Nature 368 , 812-13 (1994); Fishwild et aL Nature Biotechnology 14, 845-51 (1996); Neuberger, 

40 Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 
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The antibodies may also be affinity matured using known selection and/or mutagenesis methods as 
described above. Preferred affinity matured antibodies have an affinity which is five times, more preferably 
10 times, even more preferably 20 or 30 times greater than the starting antibody (generally murine, 
humanized or human) from which the matured antibody is prepared. 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
subunit. 

Methods for making bispecific antibodies are knov/n in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light- 
chain pairs, where the two heavy chains have different specificities [MUstein and Cuelkv Nature, 305:537- 
539 (1983)]. Because of the random assortment of immunoglobulin heavy and light chains, these 
hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of which only one 
has the correct bispecific structure. The purification of the correct molecule is usually accomplished by 
affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993, 
and in Traunecker et al., EMBO J. . 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) 
can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an 
immunoglobulin heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. 
It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary for light- 
chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain 
fusions and, if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are 
co-transfected into a suitable host organism. For further details of generating bispecific antibodies see, for 
example, Suresh et al., Methods in Enzvmolog v. 121:210 (1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 
molecules can be engineered to maximize the percentage of heterodimers which are recovered from 
recombinant cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody 
constant domain. In this method, one or more small amino acid side chains from the interface of the first 
antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory 
"cavities* 1 of identical or similar size to the large side chain(s) are created on the interface of the second 
antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine). 
This provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products 
such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab*) 2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using 
chemical linkage. Brennan et al, Science 229:81 (1985) describe a procedure wherein intact antibodies are 
proteolytically cleaved to generate F(ab')i fragments. These fragments are reduced in the presence of the 
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dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide 
formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of 
the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The 
5 bispecific antibodies produced can be used as agents for the selective immobilization of enzymes. 

Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby et al, J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized 
bispecific antibody F(ab') 2 molecule. Each Fab' fragment was separately secreted from E. coli and 
subjected to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody 

10 thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well 
as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from 
recombinant cell culture have also been described. For example, bispecific antibodies have been produced 
using leucine zippers. Kostelny et al, J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides 

15 from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. 
The antibody homodimers were reduced at the hinge region to form monomers and then re-oxidized to form 
the antibody heterodimers. This method can also be utilized for the production of antibody homodimers. 
The "diabody" technology described by Hollinger et al, Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) 
has provided an alternative mechanism for making bispecific antibody fragments. The fragments comprise a 

20 heavy-chain variable domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too 
short to allow pairing between the two domains on the same chain. Accordingly, the V H and V L domains of 
one fragment are forced to pair with the complementary V L and V H domains of another fragment, thereby 
forming two antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of 
single-chain Fv (sFv) dimers has also been reported. See, Gruber et al, J. Immunol. 152:5368 (1994). 

25 Antibodies with more than two valencies are contemplated. For example, trispeciflc antibodies can be 
prepared. Tuttgf al. J. Immunol. 147:60(1991). 

Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide 
herein. Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a 
triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc 

30 receptors for IgG (FcyR), such as FcyRI (CD64), FcyRH (CD32) and FcvRHI (CD16) so as to focus cellular 
defense mechanisms to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also 
be used to localize cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies 
possess a PRO-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as 
EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the PRO polypeptide and 

35 further binds tissue factor (TF). 

5. Heteroconjugate Antibodies 
Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been 
proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of 

40 HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be 
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prepared in vitro using known methods in synthetic protein chemistry, including those involving 
crosslinking agents. For example, immunotoxins may be constructed using a disulfide exchange reaction or 
by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and 
methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may 
be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased 
complement- mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al. , J. 
Exp Med ., 176: 1191-1195 (1992) and Shopes, J. Immunol ., 148 : 2918-2922 (1992). Homodimeric 
antibodies with enhanced anti- tumor activity may also be prepared using hetero Afunctional cross-linkers as 
described in Wolff et al Cancer Research . 53: 2560-2565 (1993). Alternatively, an antibody can be 
engineered that has dual Fc regions and may thereby have enhanced complement lysis and ADCC 
capabilities. See Stevenson et al, Anti-Cancer Drug Design. 3: 219-230 (1989). 

7. Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic 
agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, 
or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described 
above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, 
nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A 
chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca 
americana proteins (PAPI, PAPIT, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A 
variety of radionuclides are available for the production of radioconjugated antibodies. Examples include 
2I2 Bi, 13, I, 13, In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein- 
coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), 
bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as 
disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p- 
azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such 
as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al, Science , 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene 
triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of radio nucleotide to 
the antibody. See W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, 
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followed by removal of unbound conjugate from the circulation using a clearing agent and then 
administration of a "ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide). 

8. Immunoliposomes 
The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes 
5 containing the antibody are prepared by methods known in the art, such as described in Epstein et al., Proc, 
Natl. Acad. Sci. USA , 82: 3688 (1985); Hwang et al., Proc. Natl Acad. Sci. USA , 77: 4030 (1980); and U.S. 
Pat Nos. 4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent 
No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method 
10 with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG-derivatized 
phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of defined pore size to yield 
liposomes with the desired diameter. Fab' fragments of the antibody of the present invention can be 
conjugated to the liposomes as described in Martin et al J. Biol. Chem., 257: 286-288 (1982) via a v 
disulfide-interchange reaction. A chemotherapeutic agent (such as Doxorubicin) is optionally contained 
15 within the liposome. See Gabizon et al, J. National Cancer Inst. , 81(19): 1484 (1989). 

M. Pharmaceutical Compositions 

The active PRO molecules of the invention (e.g., PRO polypeptides, anti-PRO antibodies, and/or 
variants of each) as well as other molecules identified by the screening assays disclosed above, can be 
administered for the treatment of immune related disease, in the form of pharmaceutical compositions. 

20 Therapeutic formulations of the active PRO molecule, preferably a polypeptide or antibody of the 

invention, are prepared for storage by mixing the active molecule having the desired degree of purity with 
optional pharmaceutically acceptable carriers, excipients or stabilizers (Remington's Pharmaceutical 
Sciences 16th edition, Osol, A. Ed. [1980]), in the form of lyophilized formulations or aqueous solutions. 
Acceptable carriers, excipients, or stabilizers are nontoxic to recipients at the dosages and concentrations 

25 employed, and include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid and methionine; preservatives (such as octadecyldimethylbenzyl ammonium chloride; 
hexamethonium chloride; benzalkonium chloride, benzethontum chloride; phenol, butyl or benzyl alcohol; 
alkyl parabens such as methyl or propyl paraben; catechol; resorcinol; cyclohexanol; 3-pentanol; and m- 
cresol); low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, 

30 gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as 
glycine, giutamine, asparagine, histidine, arginine, or lysine; monosaccharides, disaccharides, and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugars such as 
sucrose, mannitol, trehalose or sorbitol; salt-forming counter-ions such as sodium; metal complexes (e.g., 
Zn-protein complexes); and/or non-ionic surfactants such as TWEEN™, PLURONICS™ or polyethylene 

35 glycol (PEG). 

Compounds identified by the screening assays disclosed herein can be formulated in an analogous 
manner, using standard techniques well known in the art. 

Lipofections or liposomes can also be used to deliver the PRO molecule into cells. Where antibody 
fragments are used, the smallest inhibitory fragment which specifically binds to the binding domain of the 
40 target protein is preferred. For example, based upon the variable region sequences of an antibody, peptide 
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molecules can be designed which retain the ability to bind the target protein sequence. Such peptides can be 
synthesized chemically and/or produced by recombinant DNA technology (see, e.g., Marasco et al, Proc. 
Natl. Acad. Sci. USA90, 7889-7893 [1993]). 

The formulation herein may also contain more than one active compound as necessary for the 
particular indication being treated, preferably those with complementary activities that do not adversely 
affect each other. Alternatively, or in addition, the composition may comprise a cytotoxic agent, cytokine or 
growth inhibitory agent. Such molecules are suitably present in combination in amounts that are effective 
for the purpose intended. 

The active PRO molecules may also be entrapped in microcapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin- 
microcapsules and poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems 
(for example, liposomes, albumin microspheres, microemulsions, na no-particles and nanocapsules) or in 
macroemulsions. Such techniques are disclosed in Remington's Phamiaceutical Sciences 16th edition, Osol, 
A. Ed. (1980). 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished 
by filtration through sterile filtration membranes. 

Sustained-release preparations of the PRO molecules may be prepared. Suitable examples of 
sustained-release preparations include semipermeable matrices of solid hydrophobic polymers containing the 
antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples of 
sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), 
or poly(vinylalcohol)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y-ethyl-L- 
glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as 
the LUPRON DEPOT™ (injectable microspheres composed of lactic acid-glycolic acid copolymer and 
leuprolide acetate), and poly-D-(-)-3-hydroxy butyric acid. While polymers such as ethylene -vinyl acetate 
and lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins 
for shorter time periods. When encapsulated antibodies remain in the body for a long time, they may 
denature or aggregate as a result of exposure to moisture at 37°C, resulting in a loss of biological activity 
and possible changes in immunogenicity. Rational strategies can be devised for stabilization depending on 
the mechanism involved. For example, if the aggregation mechanism is discovered to be intermolecuiar S-S 
bond formation through thio-disulfide interchange, stabilization may be achieved by modifying sulfhydryl 
residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate additives, and 
developing specific polymer matrix compositions. 
N. Methods of Treatment 

It is contemplated that the polypeptides, antibodies and other active compounds of the present 
invention may be used to treat psoriasis, IBD and related conditions, such as T cell mediated diseases, 
including those characterized by infiltration of inflammatory cells into a tissue. 

Spondyloarthropathies are a group of disorders with some common clinical features and the 
common association with the expression of HLA-B27 gene product. The disorders include: ankylosing 
sponylitis, Reiter's syndrome (reactive arthritis), arthritis associated with inflammatory bowel disease, 
spondylitis associated with psoriasis, juvenile onset spondyloarthropathy and undifferentiated 
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spondyloarthropathy. Distinguishing features include sacroileitis with or without spondylitis; inflammatory 
asymmetric arthritis; association with HLA-B27 (a serologically defined allele of the HLA-B locus of class I 
MHC); ocular inflammation, and absence of autoantibodies associated with other rheumatoid disease. The 
cell most implicated as key to induction of the disease is the CD8+ T lymphocyte, a cell which targets 
5 antigen presented by class I MHC molecules. CD8+ T cells may react against the class I MHC allele HLA- 
B27 as if it were a foreign peptide expressed by MHC class I molecules. It has been hypothesized that an 
epitope of HLA-B27 may mimic a bacterial or other microbial antigenic epitope and thus induce a CD8+ T 
cells response. 

Systemic sclerosis (scleroderma) has an unknown etiology. A hallmark of the disease is induration 
10 of the skin; likely this is induced by an active inflammatory process. Scleroderma can be localized or 
systemic; vascular lesions are common and endothelial cell injury in the micro vasculature is an early and 
important event in the development of systemic sclerosis; the vascular injury may be immune mediated. An 
, immunologic basis is implied by the presence of mononuclear cell infiltrates in the cutaneous lesions and the 

presence of anti-nuclear antibodies in many patients. ICAM-1 is often upreguiated on the cell surface of 
15 fibroblasts in skin lesions suggesting that T cell interaction with these cells may have a role in the 
pathogenesis of the disease. Other organs involved include: the gastrointestinal tract: smooth muscle 
atrophy and fibrosis resulting in abnormal peris talsis/motility; kidney: concentric subendothelial intimal 
proliferation affecting small arcuate and interlobular arteries with resultant reduced renal cortical blood flow, 
results in proteinuria, azotemia and hypertension; skeletal muscle: atrophy, interstitial fibrosis; 
20 inflammation; lung: interstitial pneumonitis and interstitial fibrosis; and heart: contraction band necrosis, 
scarring/fibrosis. 

Autoimmune or Immune-mediated Skin Disease including Bullous Skin Diseases, Erythema 
Multiforme, and Contact Dermatitis are mediated by auto-antibodies, the genesis of which is T lymphocyte- 
dependent. 

25 Psoriasis is proposed to be a T lymphocyte-mediated inflammatory disease. Lesions contain 

infiltrates of T lymphocytes, macrophages and antigen processing cells, and some neutrophils. 

Transplantation associated diseases, including Graft rejection and Graft-Versus-Host-Disease 
(GVHD) are T lymphocyte-dependent; inhibition of T lymphocyte function is ameliorative. 

The compounds of the present invention, e.g., polypeptides or antibodies, are administered to a 
30 mammal, preferably a human, in accord with known methods, such as intravenous administration as a bolus 
or by continuous infusion over a period of time, by intramuscular, intraperitoneal, intracerobrospinal, 
subcutaneous, intra-articular, intrasynovial, intrathecal, oral, topical, or inhalation (intranasal, 
intrapulmonary) routes. Intravenous or inhaled administration of polypeptides and antibodies is preferred. 

In immunoadjuvant therapy, other therapeutic regimens, such administration of an anti-cancer 
35 agent, may be combined with the administration of the proteins, antibodies or compounds of the instant 
invention. For example, the patient to be treated with a the immunoadjuvant of the invention may also 
receive an anti-cancer agent (chemotherapeutic agent) or radiation therapy. Preparation and dosing 
schedules for such chemotherapeutic agents may be used according to manufacturers' instructions or as 
determined empirically by the skilled practitioner. Preparation and dosing schedules for such chemotherapy 
40 are also described in Chemotherapy Service Ed., M.C. Perry, Williams & Wilkins, Baltimore, MD (1992). 
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The chemotherapeutic agent may precede, or follow administration of the immunoadjuvant or may be given 
simultaneously therewith. Additionally, an anti-estrogen compound such as tamoxifen or an anti- 
progesterone such as onapristone (see, EP 616812) may be given in dosages known for such molecules. 

It may be desirable to also administer antibodies against other immune disease associated or tumor 
5 associated antigens, such as antibodies which bind to CD20, CD 11a, CD 18, ErbB2, EGFR, ErbB3, ErbB4, 
or vascular endothelial factor (VEGF). Alternatively, or in addition, tv/o or more antibodies binding the 
same or two or more different antigens disclosed herein may be coadministered to the patient Sometimes, it 
may be beneficial to also administer one or more cytokines to the patient. In one embodiment, the PRO 
polypeptides are coadministered with a growth inhibitory agent. For example, the growth inhibitory agent 
10 may be administered first, followed by a PRO polypeptide. However, simultaneous administration or 
administration first is also contemplated. Suitable dosages for the growth inhibitory agent are those 
presently used and may be lowered due to the combined action (synergy) of the growth inhibitory agent and 
the PRO polypeptide. \ 

For the treatment or reduction in the severity of immune related disease, the appropriate dosage of 
15 an a compound of the invention will depend on the type of disease to be treated, as defined above, the 
severity and course of the disease, whether the agent is administered for preventive or therapeutic purposes, 
previous therapy, the patient's clinical history and response to the compound, and the discretion of the 
attending physician. The compound is suitably administered to the patient at one time or over a series of 
treatments. 

20 For example, depending on the type and severity of the disease, about 1 |ig/kg to 15 mg/kg (e.g., 

0.1-20 mg/kg) of polypeptide or antibody is an initial candidate dosage for administration to the patient, 
whether, for example, by one or more separate administrations, or by continuous infusion. A typical daily 
dosage might range from about 1 }ig/kg to 100 mg/kg or more, depending on the factors mentioned above. 
For repeated administrations over several days or longer, depending on the condition, the treatment is 

25 sustained until a desired suppression of disease symptoms occurs. However, other dosage regimens may be 
useful. The progress of this therapy is easily monitored by conventional techniques and assays. 
O. Articles of Manufacture 

In another embodiment of the invention, an article of manufacture containing materials (e.g., 
comprising a PRO molecule) useful for the diagnosis or treatment of the disorders described above is 

30 provided. The article of manufacture comprises a container and an instruction. Suitable containers include, 
for example, bottles, vials, syringes, and test tubes. The containers may be formed from a variety of 
materials such as glass or plastic. The container holds a composition which is effective for diagnosing or 
treating the condition and may have a sterile access port (for example the container may be an intravenous 
solution bag or a vial having a stopper pierceable by a hypodermic injection needle). The active agent in the 

35 composition is usually a polypeptide or an antibody of the invention. An instruction or label on, or 
associated with, the container indicates that the composition is used for diagnosing or treating the condition 
of choice. The article of manufacture may further comprise a second container comprising a 
pharmaceutically-acceptable buffer, such as phosphate-buffered saline, Ringer's solution and dextrose 
solution. It may further include other materials desirable from a commercial and user standpoint, including 

40 other buffers, diluents, filters, needles, syringes, and package inserts with instructions for use. 
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P. Diagnosis and Prognosis of Immun e Related Disease 

Cell surface proteins, such as proteins which are overexpressed in immune related disease, are 
excellent targets for drug candidates or disease treatment. The same proteins along with secreted proteins 
encoded by the genes amplified in immune related disease find additional use in the diagnosis and prognosis 

5 of this disease. For example, antibodies directed against the protein products of genes amplified in immune 
related disease, can be used as diagnostics or prognostics. 

For example, antibodies, including antibody fragments, can be used to qualitatively or 
quantitatively detect the expression of proteins encoded by amplified or overexpressed genes ("marker gene 
products")- The antibody preferably is equipped with a detectable, e.g., fluorescent label, and binding can be 

10 monitored by light microscopy, flow cytometry, fluorimetry, or other techniques known in the art. These 
techniques are particularly suitable, if the overexpressed gene encodes a cell surface protein. Such binding 
assays are performed essentially as described above. 

In situ detection of antibody binding to the marker gene products can be performed, for example, by 
immunofluorescence or immunoelectron microscopy. For this purpose, a histological specimen is removed 

15 from the patient, and a labeled antibody is applied to it, preferably by overlaying the antibody on a biological 
sample. This procedure also allows for determining the distribution of the marker gene product in the tissue 
examined. It will be apparent for those skilled in the art that a wide variety of histological methods are 

readily available for in situ detection. 

The following examples are offered for illustrative purposes only, and are not intended to limit the 

20 scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by 

reference in their entirety. 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
25 instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, 
Manassas, VA. 

EXAMPLE 1: Microarrav analysis 

30 Skin biopsies from psoriatic patients and from healthy donors ( henceforth, "normal skin") were 

obtained. For each psoriatic patient, skin samples were taken from lesional and non-lesional sites, in order 
to identify disease specific genes which are differentially expressed in psoriatic tissue. All of the psoriatic 
skin samples were analyzed for Keratin 16 staining via immunohistochemistry and epidermal thickness. All 
samples were stored at -70 °C until ready for RNA isolation. The skin biopsies were homogenized in 600|il 

35 of RLT buffer (+ BME) and RNA was isolated using Qiagen™ Rneasy Mini columns ( Qiagen ) with on- 
column DNase treatment following the manufacturer's guidelines. Following RNA isolation, RNA was 
quantitated using RiboGreen™ (Molecular Probes ) following the manufacturer's guidelines and checked on 
agarose gels for integrity. The RNA yields ranged from 19 to 54 fig for psoriatic lesional skin, 7.7 to 24 \ig 
for non-lesional matched control skin and 5.4 to lOjLig for normal skin. 4u.g of RNA was labeled for 

40 microarray analysis and samples were run on proprietary Genentech microarray and Affymetrics 
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microarrays. Genes were compared whose expression was upregulated or downregulated in psoritic skin vs 
non-lesional skin, thus comparing expression profiles of non-iesional skin and psoriatic skin from the same 
patient, and also comparing against normal skin biopsies of normal healthy donors as a further control. 

To test the PRO polypeptides and nucleic acids of the invention for their role in Inflammatory 

5 Bowel Disease, a microarray assay was used to find genes that are overexpressed in IBD as compared to 
normal bowel tissue. Biopsies from patients with IBD were obtained. For each IBD patient, samples were 
taken from disease (either ulcerative colitis (UC) or Crohn* s) tissue and from healthy bowel, so that 
expression patterns could be better compared. All samples were stored at -70°C until ready for RNA 
isolation. The biopsies were homogenized in 600^1 of RLT buffer (+ BME) and RNA was isolated using 

10 Qiagen™ Rneasy Mini columns ( Qiagen ) with on-column DNase treatment following the manufacturer's 
guidelines. Following RNA isolation, RNA was quantitated using RiboGreen™ (Molecular Probes ) 
following the manufacturer's guidelines and checked on agarose gels for integrity. Appropriate amounts of 
RNA were labeled for microarray analysis and samples were run on proprietary Genentech microarray and 
Affymetrics™ microarrays. Genes were compared whose expression was upregulated in UC or Crohn's 

15 tissue vs normal bowel, matching biopsies from normal bowel and disease tissue from the same patient. 

In an experiment to test the expression of the PRO polypeptides and associated nucleotides of the 
invention upon co-stimulation, CD4+ T cells were purified from a single donor using the RossetteSep™ 
protocol from (Stem Cell Technologies, Vancouver BC) which contains anti-CD8, anti-CD 16, anti-CD 19, 
anti-CD36 and anti-CD56 antibodies used to produce a population of isolated CD4 + T cells. Isolated CD4+ 

20 T cells were activated with an anti-CD3 antibody (used at a concentration that does not stimulate 
proliferation) together with either ICAM-1 or anti-CD28 antibody. At 24 or 72 hours cells were harvested, 
RNA extracted and analysis run on Affimax™ (Affymetrix Inc. Santa Clara, CA) microarrays. Non- 
stimulated (resting) cells were harvested immediately after purification, and subjected to the same analysis. 
Genes were compared whose expression was upregulated at either of the two timepoints in activated vs. 

25 resting cells. 

To examine the expression of the PRO polypeptides and associated nucleic acids for Rheumatoid 
Arthritis (RA), a proprietary database containing gene expression information (GeneExpress®, Gene Logic 
Inc., Gaithersburg, MD) was analyzed to identify polypeptides (and their encoding nucleic acids) whose 
expression is significantly upregulated in RA as compared to normal tissues. Specifically, analysis of the 

30 GeneExpress® database was conducted using either software available through Gene Logic Inc., 
Gaithersburg, MD, for use with the GeneExpress® database or with proprietary software written and 
developed at Genentech, Inc. for use with the GeneExpress® database. The rating of positive hits in the 
analysis is based upon several criteria including, for example; expression level, tissue specificity, and 
expression level in normal essential and/or normal proliferating tissues. The following is a list of molecules 

35 whose tissue expression profile as determined from an analysis of the GeneExpress® database evidences 
high tissue expression and significant upregulation of expression in PBMC from twelve RA patients as 
compared to normal PBMC taken from twenty-five healthy individuals and optionally relatively low 
expression in normal essential and/or normal proliferating tissues. As such, the molecules listed below are 
excellent polypeptide targets for the diagnosis and alleviation of RA in mammals. 

40 
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The conclusion of these experiments for the nucleic acids and encoded PRO polypeptides of 
Figures 1-64 are summarized below. 



DNA275Q62 ) Figure 1 (SEP ID NO: 1) PRQ62782) Figure 2 (SEP ID NP:2) 
5 DNA275062 is down-regulated 2 fold in colon samples from Crohn's disease and ulcerative colitis patients 
as compared to normal colon. It is also down-regulated 3 fold upon activation of CD4 T cells with CD28 or 
ICAM. 

DNA226359 ) Figure 3 (SEP ID NO:3PRQ36822 ) Figure 4 (SEP ID NP:4) 
10 DNA226359 is down-regulated 2 fold in colon samples from Crohn's disease and ulcerative colitis patients 
as compared to normal colon. It is also down-regulated 3 fold upon activation of CD4 T cells with CD28 or 
ICAM, and down-regulated 4 fold upon differentiation of monocytes into macrophages after 7 days in 
differentation media. 

15 DNA226763 Figure 5 (SEP ID NP:5) PRP37226 Figure 6 (SEP ID NP:6) 

DNA226763 is up-regulated 2 fold in colon samples from ulcerative colitis patients as compared to normal 
colon. It is also up-regulated 2 fold in memory T cells as compared ,o naive CD4 T cells, and up-regulated 6 
fold upon differentiation of monocytes into macrophages after 1 day in differentation media or activation 
with LPS. 

20 

DNA304716 Figure 7 (SEP ID NP:7) PRP71142 Figure 8 (SEP ID NP:8) 

DNA304716 is up-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 2 fold in colon samples from ulcerative colitis patients as compared to normal 
colon. It is also up-regulated 2 fold in CD4 T cells activated with CD28 and ICAM, up-regulated 8 fold 
25 upon differentiation of monocytes into macrophages after 1 day in differentation media, up-regulated 2 fold 
upon activation of monocytes with LPS and up-regulated 2 fold upon activation of dendritic cells with LPS. 

DNA228014 Figure 9 (SEP ID NP:9) PRQ38477 Figure 10 (SEP ID NO: 10) 

DNA228014 is up-regulated 2 fold in colon samples from ulcerative colitis patients as compared to normal 
30 colon and down-regulated 2 fold in white blood cells from rheumatoid arthritis patients as compared to those 
from normal donors. It is also down-regulated 3 fold upon activation of CD4 T cells with CD28 or ICAM, 
and down-regulated 3000 fold upon differentiation of monocytes into macrophages after 1 day in 
differentation media. 

35 DNA143498 Figure 1 1 (SEP ID NO: 11) PRO 10275 Figu re 12 (SEP ID NO: 12) 

DNA143498 is up-regulated 1.5 fold in colon samples from Crohn's disease patients as compared to normal 
colon. It is also up-regulated 1.5 fold upon activation of CD4 T cells with CD28 or ICAM and up-regulated 
2 fold upon differentiation of monocytes into macrophages after 1 day in differentation media. 

40 DNA324792 Figure 13 (SEP ID NP:13) PRP814Q7 Figu reU (SEP ID NO: 14) 

DNA324792 is up-regulated 2 fold in white blood cells from rheumatoid arthritis patients as compared to 
those from normal donors. It is also up-regulated 1.5 fold upon activation of CD4 T cells with CD28 or 
ICAM and up-regulated 1.3 fold upon differentiation of monocytes into macrophages after 1 day in 
differentiation media and up-regulated 2 fold upon activation of both monocytes and dendritic cells with 

45 LPS. 

DNA287319 Figurel5 (SEP ID NP:15) PRQ69584 Figure 16 (SEP ID NO:16) 

DNA287319 is up-regulated 2 fold in colon samples from ulcerative colitis patients and 1.5 fold in colon 
samples from Crohn's disease patients as compared to normal colon. It is also up-regulated 1.5 fold upon 
50 activation of CD4 T cells with CD28 or ICAM. 

DNA333620 Figure 17 (SEP ID NO: 17) PRQ88263 Figure 1 8 (SEP ID NO: 18) 

DNA333620 is up-regulated 1.3 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, up-regulated 1.5 fold in colon samples from Crohn's disease patients as compared to normal colon 
55 and up-regulated 1.3 fold in white blood cells from rheumatoid arthritis patients as compared to those from 
normal donors. It is also down-regulated 2 fold in NK cells upon activation with IL15 or IL12, down- 
regulated 3 fold upon differentiation of monocytes into macrophages after 1 day in differentiation media and 
up-regulated 8 fold upon activation of dendritic cells with LPS. 
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DNA324275 Figure 19 (SEP ID NO: 19) PRP8Q958 Figure 20 (SEP ID NQ:2Q) 

DNA324275 is up-regulated 1.8 fold in colon samples from Crohn's disease patients as compared to normal 
colon. It is also up-regulated 2 fold upon activation of CD4 T cells with CD28 or ICAM. 

DNA227952 Fig ure 21 (SEP ID NO:2n PRP 38415 Figure 22 (SEP ID NP:22) 

DNA227952 is up-regulated 2 fold in colon samples from ulcerative colitis patients and 1.2 fold in colon 
samples from Crohn's disease patients as compared to normal colon. It is also up-regulated 2 fold upon 
activation of CD4 T cells with CD28 or ICAM and up-regulated 2.5 fold upon activation of B cells with IL4. 

DNA103507 Figure 23 (SEP ID NO:21) PRP 4834 Figure 24 (SEP ID NO:24) 

DNA 103507 is up-regulated 1.3 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 1.8 fold in colon samples from ulcerative colitis patients as compared to normal 
colon. It is also up-regulated 3 fold in dendritic cells upon activation with LPS. 



DNA226256 Figure 25 (SEP ID NP:25> PRQ36719 Figure 26 (SEP ID NO:26) 
DNA226256 is up-regulated 1.3 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 1.5 fold in colon samples from ulcerative colitis patients and 2 fold in colon 
samples from Crohn's disease patients as compared to normal colon. It is also down-regulated 2 fold upon 
20 activation of CD4 T cells with CD28 or ICAM and down-regulated 4 fold upon differentiation of monocytes 
into macrophages after 1 day in differentiation media or upon activation with LPS. 

DNA324799 Figure 27 (SEP ID NO:27^ PRP 8 141 4 Figure 28 (SEP ID NP:28) 

DNA324799 is up-regulated 1.2 fold in lesional skin as compared to non-lesional skin from psoriasis 
25 patients and up-regulated 1.5 fold in colon samples from ulcerative colitis patients as compared to normal 
colon. It is also down-regulated 2 fold upon differentiation of monocytes into macrophages after 7 days in 
differentiation media and up-regulated 4 fold in dendritic cells upon activation with LPS. 

DNA255836 Figure 29 (SEP ID NO:29^> PRO50891 Figure 30 (SEP ID NP:30) 
30 DNA255836 is up-regulated 2 fold in lesional skin as compared to non-lesional skin from psoriasis patients 
and up-regulated 2.2 fold in colon samples from ulcerative colitis patients as compared to normal colon. It is 
also up-regulated 1.8 fold in B cells upon activation with IL4, down-regulated 2 fold upon differentiation of 
monocytes into macrophages after 7 days in differentiation media and up-regulated 3 fold in dendritic cells 
upon activation with LPS. 

35 

DNA339972 Figure 31 (SEP ID NP:3D PRQ91480 Figur e 32 (SEP ID NP:32) 

DNA339972 is down-regulated 1.8 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and down-regulated 1.5 fold in colon samples from Crohn's disease patients as compared to normal 
colon. It is also down-regulated 2 fold upon differentiation of monocytes into macrophages after 7 days in 
40 differentiation media. 

DNA323910 Figure 33 (SEP ID NP:33) PRP80648 Fig ure 34 (SEP ID NP:34) 

DNA323910 is up-regulated 4 fold in lesional skin as compared to non-lesional skin from psoriasis patients 
and down-regulated 1.5 fold in white blood cells from rheumatoid arthritis patients as compared to those 
45 from normal donors. It is also down-regulated 10 fold upon activation of CD4 T cells with CD28 or ICAM, 
down-regulated 10 fold upon differentiation of monocytes into macrophages after 7 days in differentiation 
media and up-regulated 2 fold upon activation of monocytes with LPS. 

DNA226272 Figure 35 (SEP ID NP:35^ PRP36735 Figur e 36 (SEP ID NP:36) 
50 DNA226272 is up-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 1.8 fold in white blood cells from rheumatoid arthritis patients as compared to 
those from normal donors. It is also up-regulated 1.5 fold upon activation of CD4 T cells with CD28 or 
ICAM and down-regulated 2.5 fold upon differentiation of monocytes into macrophages after I day in 
differentiation media. 



DNA151772 Figure 37 (SEP ID NP:37^ PRP12050 Figure 38 (SEP ID NP:38) 

DNA151772 is up-regulated 1.2 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 2 fold in colon samples from Crohn's disease patients as compared to normal 
colon It is also up-regulated 2 fold upon activation of CD4 T cells with CD28 or ICAM, up-regulated 2 fold 
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upon differentiation of monocytes into macrophages after 7 days in differentiation media and up-regulated 
2.5 fold upon activation of monocytes and dendritic cells with LPS. 

DNA327933 Figure 39 fSEO ID NO:39^ PRQ839Q3 Figure 40 (SEP ID NO:40) 
5 DNA327983 is up-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, up-regulated 1.3 fold in colon samples from Crohn's disease patients as compared to normal colon 
and up-regulated 1 .3 fold in white blood cells from rheumatoid arthritis patients as compared to those from 
normal donors. It is also down-regulated 1 .5 fold upon activation of CD4 T cells with CD28 or ICAM and 
down-regulated 2 fold upon activation of monocytes with LPS. 

10 

DNA227046 Figure 41 (SEP ID NO:4D PRQ37509 Figure 42 (SEP ID NO:42) 

DNA227046 is down-regulated 1.8 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, down-regulated 1.3 fold in colon samples from both ulcerative colitis and Crohn's disease patients 
as compared to normal colon and down-regulated 2 fold in white blood cells from rheumatoid arthritis 
1 5 patients as compared to those from normal donors. It is also down-regulated 2 fold upon activation of CD4 
T cells with CD28 or ICAM, down-regulated 1 .5 fold in B cells upon activation with IL4 and down- 
regulated 4 fold upon differentiation of monocytes into macrophages after 7 days in differentiation media. 

DNA227297 Figure 43 (SEP ID NP:43) PRP37760 Figu re 44 (SEP ID NP:44^ \ 
20 DNA227297 is up-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 1.3 fold in white blood cells from rheumatoid arthritis patients as compared to 
those from normal donors. It is also up-regulated 1.3 fold in both monocytes and dendritic cells upon 
activation with LPS. 

25 DNA340442 Figure 45 (SEP ID NP:45) PRP92173 Figu re 46 (SEP ID NP:46) . 
DNA340442 is up-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and 2.5 fold in white blood cells from rheumatoid arthritis patients as compared to those from 
normal donors. It is also down-regulated 2.2 fold upon activation of CD4 T cells with CD28 or ICAM and 
down-regulated 12 fold upon differentiation of monocytes into macrophages after 7 days in differentiation 

30 media. 

DNA324641 Figure 47 (SEP ID NP:47) PRP1Q849 Figu re 48 (SEP ID NP:48) 

DNA324641 is down-regulated 1.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, up-regulated 1.5 fold in colon samples from both ulcerative colitis and Crohn's disease patients as 
35 compared to normal colon and down-regulated 2 fold in white blood cells from rheumatoid arthritis patients 
as compared to those from normal donors. It is also down-regulated 2 fold upon activation of CD4 T cells 
with CD28 or ICAM, down-regulated 2 fold upon activation of NK cells with IL15 or IL2 and down- 
regulated 3 fold upon differentiation of monocytes into macrophages after 7 days in differentiation media. 

40 DNA275257 Figure 49 (SEP ID NP:49> PRP62943 Figu re. 50 (SEP ID NP:50) 

DNA275257 is up-regulated 4 fold in lesional skin as compared to non-lesional skin from psoriasis patients 
and up-regulated 2 fold in white blood cells from rheumatoid arthritis patients as compared to those from 
normal donors. It is also down-regulated 2 fold upon activation of CD4 T cells with CD28 or ICAM, down- 
regulated 6 fold upon differentiation of monocytes into macrophages after 7 days in differentiation media 

45 and up-regulated 4 fold upon activation of monocytes with LPS. 

DNA331909 Figure 51 (SEP ID NP:51) PRP86795 Figu re. V. (SEP ID NP:52) 

DNA331909 is up-regulated 2.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, up-regulated 10 fold in colon samples from ulcerative colitis patients as compared to normal colon 
50 and up-regulated 1.5 fold in white blood cells from rheumatoid arthritis patients as compared to those from 
normal donors. It is also down-regulated 9 fold upon activation of CD4 T cells with CD28 or ICAM, down- 
regulated 2 fold upon activation of NK cells with IL15 or IL2 and down-regulated 4 fold upon differentiation 
of monocytes into macrophages after 7 days in differentiation media. 

55 DNA2261 1 1 Figure 53 (SEP ID NP:53) PRP36574 Figure 54 (SEP ID NP:54) 

DNA226111 is up-regulated 2.5 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients. It is also up-regulated 8 fold upon differentiation of monocytes into macrophages after 7 days in 
differentiation media. 
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DNA287270 Figure 55 (SEP ID NP:55^ PR069541 Figure 56 (SEP ID NO:5ff> 

DNA287270 is up-regulated 1.2 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients, up-regulated 1.2 fold in colon samples from Crohn's disease patients and up-regulated 1.8 fold in 
ulcerative colitis patients as compared to normal colon. It is also up-regulated 1.8 fold upon activation of 
5 CD4 T cells with CD28 or ICAM and up-regulated 2 fold upon activation of NK cells with IL15 or IL2. 

DNA329369 F i gure 57 (SEP ID NP:57^ PRQ84948 Figure 58 (SEQ ID NP:58) 

DNA329369 is up-regulated 2 fold in lesional skin as compared to non-lesional skin from psoriasis patients, 
up-regulated 1.5 fold in colon samples from Crohn's disease patients as compared to normal colon and up- 
10 regulated 1.5 fold in white blood cells from rheumatoid arthritis patients as compared to those from normal 
donors. It is also up-regulated 1.5 fold upon differentiation of monocytes into macrophages after 1 day in 
differentiation media and up-regulated 1.5 fold upon activation of monocytes with LPS. 

DNA33Q645 Figure 59 fSEP ID NP:59) PRQ85817 Figure 60 (SEP ID NP:60) 
15 DNA330645 is up-regulated 1.8 fold in lesional skin as compared to non-lesional skin from psoriasis 
patients and up-regulated 2 fold in white blood cells from rheumatoid arthritis patients as compared to those 
from normal donors. It is also up-regulated 2.3 fold upon activation of CD4 T cells with CD28 or ICAM 
and up-regulated 2 fold upon activation of monocytes and 4 fold upon activation of dendritic cells with LPS. 

20 DNA227908 Figure 61 (SEP ID NP:6D PRQ38371 Figure 62 (SEP ID NO:62) 

DNA227908 is up-regulated 2.5 fold in white blood cells from rheumatoid arthritis patients as compared to 
those from normal donors. It is also up-regulated 2.5 fold upon differentiation of monocytes into 
macrophages after 1 day in differentiation media. 

25 DNA340375 Figure 63 (SEP ID NP:63) PRP90951 Figu re 64 (SEP ID NP:64) 

DNA340375 expression increases 100 fold in both Crohn's and Ulcerative coloitis colon samples versus 
normal colon. Expression is highest in B cells and T cells. In B cells, expression decreases 1.5 fold on 
activation with DL-4. In T cells, expression decreases 2 fold upon activation with CD28 or ICAM. 

30 

EXAMPLE 2: Use of PRP as a hybridization probe 

The following method describes use of a nucleotide sequence encoding PRP as a hybridization 

probe. 

DNA comprising the coding sequence of full-length or mature PRP as disclosed herein is employed 
35 as a probe to screen for homologous DNAs (such as those encoding naturally-occurring variants of PRP) in 
human tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the 
following high stringency conditions. Hybridization of radiolabeled PRP-derived probe to the filters is 
performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM 
40 sodium phosphate, pH 6.8, 2x Denhardfs solution, and 10% dextran sulfate at 42°C for 20 hours. Washing 
of the filters is performed in an aqueous solution of O.lx SSC and 0.1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRP 
can then be identified using standard techniques known in the art. 



45 EXAMPLE 3: Expression of PRP in E. coli 

This example illustrates preparation of an unglycosylated form of PRP by recombinant expression 

in E. coli. 

The DNA sequence encoding PRP is initially amplified using selected PCR primers. The primers 
should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
50 expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 

77 



WO 2004/081199 



PCT/US2004/007862 



pBR322 (derived from E. colt; see Bolivar et al., Gene, 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which 
encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, 
5 polyhis sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, 
and an argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described 
in Sambrook et al., supra . Transformants are identified by their ability to grow on LB plates and antibiotic 
resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and 
10 DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented 
with antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The 
cells are then grown to a desired optical density, during which the expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by centrifugation. The 
15 cell pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the 
solubilized PRO protein can then be purified using a metal chelating column under conditions that allow 

tight binding of the protein. 

PRO may be expressed in E, coli in a poly-His tagged form, using the following procedure. The 
DNA encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction 

20 enzyme sites which correspond to the restriction enzyme sites on the selected expression vector, and other 
useful sequences providing for efficient and reliable translation initiation, rapid purification on a metal 
chelation column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged 
sequences are then ligated into an expression vector, which is used to transform an E. coli host based on 
strain 52 (W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB 

25 containing 50 mg/ml carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are 
then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium 
citrate*2H20, 1.07 g KC1, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 
110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours 
at 30°C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk 

30 culture is centrifuged to pellet the cells. Cell pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 
M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0.1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step 
results in a denatured protein with all cysteine residues blocked by suifitolization. The solution is centrifuged 

35 at 40,000 rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of 
metal chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to 
clarify. The clarified extract is loaded onto a 5 mi Qiagen Ni-NTA metal chelate column equilibrated in the 
metal chelate column buffer. The column is washed with additional buffer containing 50 mM imidazole 
(Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. 
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Fractions containing the desired protein are pooled and stored at 4°C. Protein concentration is estimated by 
its absorbance at 280 nm using the calculated extinction coefficient based on its amino acid sequence. 

The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer 
consisting of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM 
5 EDTA. Refolding volumes are chosen so that the final protein concentration is between 50 to 100 
micrograms/ml. The refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is 
quenched by the addition of TFA to a final concentration of 0.4% (pH of approximately 3). Before further 
purification of the protein, the solution is filtered through a 0.22 micron filter and acetonitrile is added to 
2-10% final concentration. The refolded protein is chromatographed on a Poros Rl/H reversed phase 

10 column using a mobile buffer of 0.1% TFA with elution with a gradient of acetonitrile from 10 to 80%. 
Aliquots of fractions with A280 absorbance are analyzed on SDS polyacrylamide gels and fractions 
containing homogeneous refolded protein are pooled. Generally, the properly refolded species of most 
proteins are eluted at the lowest concentrations of acetonitrile since those species are the most compact with 
their hydrophobic interiors shielded from interaction with the reversed phase resin. Aggregated species are 

15 usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded forms of proteins 
from the desired form, the reversed phase step also removes endotoxin from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed 
using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 
with 0.14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine 

20 (Pharmacia) resins equilibrated in the formulation buffer and sterile filtered. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 4: Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
25 expression in mammalian cells. 

The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression 
vector. Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion 
of the PRO DNA using ligation methods such as described in Sambrook et al, supra . The resulting vector is 
called pRK5-PRO. 

30 In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) 

are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf 
serum and optionally, nutrient components and/or antibiotics. About 10 \ig pRK5-PRO DNA is mixed with 
about 1 [ig DNA encoding the VA RNA gene [Thimmappaya et al„ CeH, 31:543 (1982)] and dissolved in 
500 \i\ of 1 mM Tris-HCl, 0.1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 u.1 of 50 

35 mM HEPES (pH 7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes 
at 25°C. The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 
37°C. The culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 
293 cells are then washed with serum free medium, fresh medium is added and the cells are incubated for 
about 5 days. 
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Approximately 24 hours after the transfections, the culture medium is removed and replaced with 
culture medium (alone) or culture medium containing 200 nCi/ml 35 S-cysteine and 200 [iCi/ml 35 S- 
methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, 
and loaded onto a 15% SDS gel. The processed gel may be dried and exposed to film for a selected period 
5 of time to reveal the presence of PRO polypeptide. The cultures containing transfected cells may undergo 
further incubation (in serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran 
sulfate method described by Somparyrac et al„ Proc. Natl. Acad. ScL 12:7575 (1981). 293 cells are grown 
to maximal density in a spinner flask and 700 \ig pRK5-PRO DNA is added. The cells are first concentrated 
10 from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated 
on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue 
culture medium, and re-introduced into the spinner flask containing tissue culture medium, 5 u-g/ml bovine 
insu]in and 0.1 {xg/ml bovine transferrin. After about four days, the conditioned media is centrifuged and 
filtered to remove cells and debris. The sample containing expressed PRO can then be concentrated and 
15 purified by any selected method, such as dialysis and/or column chromatography. 

In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected 
into CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell 
cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing a 
radiolabel such as 35 S-methionine. After determining the presence of PRO polypeptide, the culture medium 
20 may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 days, and then 
the conditioned medium is harvested. The medium containing the expressed PRO can then be concentrated 
and purified by any selected method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of 
the pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as 
25 a poly-his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned 
into a SV40 promoter/enhancer containing vector containing a selection marker such as DHFR for selection 
of stable clones. Finally, the CHO cells can be transfected (as described above) with the SV40 
promoter/enhancer containing vector. Labeling may be performed, as described above, to verify expression. 
The culture medium containing the expressed poiy-His tagged PRO can then be concentrated and purified 
30 by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in 
CHO cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are 
expressed as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. 
35 extracellular domains) of the respective proteins are fused to an IgGl constant region sequence containing 
the hinge, CH2 and CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector 
using standard techniques as described in Ausubel et al, Current Protocols of Mo lecular Biology, Unit 3.16, 
John Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' 
40 and 3' of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in 
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CHO cells is as described in Lucas et al„ Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 
promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO 

5 cells using commercially available transfection reagents Superfect® (Quiagen), Dosper® or Fugene® 
(Boehringer Mannheim). The cells are grown as described in Lucas et al., sunn*. Approximately 3 x 10" 
cells are frozen in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge tube containing 10 mL of media and centrifuged at 

10 1000 rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective 
media (0.2 |im filtered PS20 with 5% 0.2 Jim diafiltered fetal bovine serum). The cells are then aliquoted 
into a 100 mL spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 
250 mL spinner filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 
250 mL, 500 mL and 2000 mL spinners are seeded with 3 x 10 5 cells/mL. The cell media is exchanged with 

15 fresh media by centrifugation and resuspension in production medium. Although any suitable CHO media 
may be employed, a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may 
actually be used. A 3L production spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, pH is determined. On 
day 1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, 
the temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 

20 polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the 
production, the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability 
dropped below 70%, the cell culture is harvested by centrifugation and filtering through a 0.22 |im filter. 
The filtrate was either stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). 

25 Before purification, imidazole is added to the conditioned media to a concentration of 5 mM. The 
conditioned media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer 
containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column 
is washed with additional equilibration buffer and the protein eluted with equilibration buffer containing 
0.25 M imidazole. The highly purified protein is subsequently desalted into a storage buffer containing 10 

30 mM Hepes, 0.14 M NaCl and 4% mannitoi, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and 
stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 
20 mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration 
35 buffer before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by 
collecting 1 ml fractions into tubes containing 275 Jil of 1 M Tris buffer, pH 9. The highly purified protein 
is subsequently desalted into storage buffer as described above for the poly-His tagged proteins. The 
homogeneity is assessed by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman 
degradation. 

40 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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RX AMPLE 5: Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
5 the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction 
enzyme sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding 
PRO can be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a 
native PRO signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or 
invertase secretory signal/leader sequence, and linker sequences (if needed) for expression of PRO. 
10 Yeast cells, such as yeast strain AB110, can then be transformed with the expression plasmids 

described above and cultured in selected fermentation media. The transformed yeast supernatants can be 
analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining 

of the gels with Coomassie Bltje stain. 

Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
15 fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. 
The concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 6: Expression of PRO in Baculovirus -Infected Insect Cells 
20 The following method describes recombinant expression of PRO in Baculovirus-infected insect 

cells. 

The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of 
IgG). A variety of plasmids may be employed, including plasmids derived from commercially available 
25 plasmids such as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the 
coding sequence of PRO such as the sequence encoding the extracellular domain of a transmembrane protein 
or the sequence encoding the mature protein if the protein is extracellular is amplified by PCR with primers 
complementary to the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction 
enzyme sites. The product is then digested with those selected restriction enzymes and subcloned into the 

30 expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold 
virus DNA (Pharmingen) into Spodoptera frugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin 
(commercially available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are 
' harvested and used for further amplifications. Viral infection and protein expression are performed as 
35 described by O'Reilley et al., Baculovirus expression vec tors: A Laboratory Manual, Oxford: Oxford 
University Press (1994). 

Expressed poly-his tagged PRO can then be purified, for example, by Ni 2+ -chelate affinity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature , 362:175-179 (1993). Briefly, S£9 cells are washed, resuspended in sonication buffer 
40 (25 mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KC1), and 
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sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is 
diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered 
through a 0.45 p.m filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared 
with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. 
The fdtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline 
A 280 with loading buffer, at which point fraction collection is started. Next, the column is washed with a 
secondary wash buffer (50 mM phosphate; 300 mM Nad, 10% glycerol, pH 6.0), which elutes 
nonspecifically bound protein. After reaching A 280 baseline again, the column is developed with a 0 to 500 
mM Imidazole gradient in the secondary wash buffer. One mL fractions are collected and analyzed by SDS- 
PAGE and silver staining or Western blot with Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). 
Fractions containing the eluted His I0 -tagged PRO are pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



EXAMPLE 7: Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra. Immunogens that may be employed include purified PRO, fusion proteins 
containing PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can 
be made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. 
Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, 
Hamilton, MT) and injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 
days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the 
mice may also be boosted with additional immunization injections. Serum samples may be periodically 
obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be 
injected with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the 
spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected 
murine myeloma cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions 
generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT 
(hypoxanthine, aminopterin, and thymidine) medium to inhibit proliferation of non-fused cells, myeloma 
35 hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
"positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the 
art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to 
40 produce ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be 
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grown in tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the 
ascites can be accomplished using ammonium sulfate precipitation, followed by gel exclusion 
chromatography. Alternatively, affinity chromatography based upon binding of antibody to protein A or 
protein G can be employed. 

5 

EXAMPLE 8: Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the 
art of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO 
polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PRO 

10 polypeptide of interest. In general, an immunoaffinity column is constructed by covalently coupling the 
anti-PRO polypeptide antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with 
ammonium sulfate or by purification on immobilized Pro.ein A (Pharmacia LKB Biotechnology, 
Piscataway, N.J.). Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium 

15 sulfate precipitation or chromatography on immobilized Protein A. Partially purified immunoglobulin is 
covalently attached to a chromatographic resin such as CnBr-activated SEPHAHOSE™ (Pharmacia LKB 
Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the derivative resin is washed 
according to the manufacturer's instructions. 

Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a 

20 fraction from cells containing PRO polypeptide in a soluble form. This preparation is derived by 
solubilization of the whole cell or of a subcellular fraction obtained via differential centrifugation by the 
addition of detergent or by other methods well known in the art. Alternatively, soluble PRO polypeptide 
containing a signal sequence may be secreted in useful quantity into the medium in which the cells are 
grown. 

25 A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and 

the column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high 
ionic strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
antibody/PRO polypeptide binding (e.g., a low pH buffer such as approximately pH 2-3, or a high 
concentration of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

30 

EXAMPLE 9: Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or 
binding fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or 
fragment employed in such a test may either be free in solution, affixed to a solid support, borne on a cell 
35 surface, or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells 
which are stably transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. 
Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable 
or fixed form, can be used for standard binding assays. One may measure, for example, the formation of 
complexes between PRO polypeptide or a fragment and the agent being tested. Alternatively, one can 
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examine the diminution in complex formation between the PRO polypeptide and its target cell or target 
receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent 
w,th an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the 
agent and the PRO polypeptide or fragment, or (ii) for the presence of a comp.ex between the PRO 
po.ypept.de or fragment and the cell, by methods well known in the art. In such competitive binding assays 
the PRO polypeptide or fragment is typically labeled. After suitable incubation, free PRO polypeptide or 
fragment is separated from that present in bound form, and the amount of free or uncomplexed label is a 
10 measure of the ability of the particular agent to bind to PRO po.ypeptide or to interfere with the PRO 
polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having 
suitable bmding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 
13 1984. Bnefly stated, large numbers of different small peptide test compounds are synthesized on a solid 
substrate, such as plastic pins or some other surface. As applied to a PRO po.ypeptide, the peptide test 
compounds a,e reacted with PRO polypeptide and washed. Bound PRO po.ypeptide is detected by methods 
well known in the art. Purified PRO polypeptide can also be coated directly onto plates for use in the 
aforementioned drug screening technique, In addition, non-neutralizing antibodies can be used to capture 
the peptide and immobilize it on the solid support. 
20 This invention also contemplates the use of competitive drug screening assays in which neutralizing 

an.bod.es capable of binding PRO polypeptide specifically compete with a test compound for binding to 
PRO polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of 
any peptide which shares one or more antigenic determinants with PRO polypeptide. 

25 EXAMPLE 10; Rational TW rw; n 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide 
of interest (/,., a PRO polypeptide) or of small molecules with which they interact, ,g., agonists 
antagonists, or inhibitors. Any of these examples can be used to fashion drugs which are more active or 
stable forms of the PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide 

iv m vivo (c.f., Hodgson, Bio/Technolo y y £: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of a PRO 
Pdypeptide-inhibitor comp.ex, is determined by x-ray crystallography, by computer modeling or, most 
typ.cal.y, by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must 
be ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful 
5 information regarding the structure of the PRO polypeptide may be gained by modeling based on the 
suture of homologous proteins. In both cases, relevant structural information is used to design ana.ogous 
PRO polypeptide-like molecu.es or to identify efficient inhibitors. Useful examples of rational drug design 
may include mo.ecu.es which have improved activity or stability as shown by Braxton and Wells 
M« 31 :7796-7801 (1992) or which act as inhibitors, agonists, or antagonists of native peptides as' 
sh0 wn b y Athauda et al , J, Biochem. . H3 :742-746 ( 1 993). 
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It is also possible to isolate a target-specific antibody, selected by functional assay, as described 
above, and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which 
subsequent drug design can be based. It is possible to bypass protein crystallography altogether by 
generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror 

5 image of a mirror image, the binding site of the anti-ids would be expected to be an analog of the original 
receptor. The anti-id could then be used to identify and isolate peptides from banks of chemically or 
biologically produced peptides. The isolated peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made 
available to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO 

10 polypeptide amino acid sequence provided herein will provide guidance to those employing computer 
modeling techniques in place of or in addition to x-ray crystallography. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since 
the deposited embodiment is intended as a single illustration of certain aspects of the invention and any 

15 constructs that are functionally equivalent are within the scope of this invention. The deposit of material 
herein does not constitute an admission that the written description herein contained is inadequate to enable 
the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as 
limiting the scope of the claims to the specific illustrations that it represents. Indeed, various modifications 
of the invention in addition to those shown and described herein will become apparent to those skilled in the 

20 art from the foregoing description and fall within the scope of the appended claims. 



\ 
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What is claimed : 

1. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence shown in Figure 1 (SEQ ID NO:l), Figure 3 (SEQ ID NO:3), Figure 5 (SEQ ID NO:5), Figure 7 
(SEQ ID NO:7), Figure 9 (SEQ ID NO:9), Figure 11 (SEQ ID NO: 11), Figure 13A-B (SEQ ID NO: 13), 
Figure 15 (SEQ ID NO: 15), Figure 17 (SEQ ID NO: 17), Figure 19 (SEQ ID NO: 19), Figure 21 (SEQ ID 
NO:21), Figure 23 (SEQ ID NO:23), Figure 25 (SEQ ID NO:25), Figure 27 (SEQ ID NO:27), Figure 29 
(SEQ ID NO:29), Figure 31A-B (SEQ ID NO:31), Figure 33 (SEQ ID NO:33), Figure 35 (SEQ ID NO:35), 
Figure 37 (SEQ ID NO:37), Figure 39A-B (SEQ ID NO:39), Figure 41A-C (SEQ ID NO:41), Figure 43 
(SEQ ID NO:43), Figure 45 (SEQ ID NO:45), Figure 47 (SEQ ID NO:47), Figure 49 (SEQ ID NO:49), 
Figure 51 (SEQ ID NO:51), Figure 53 (SEQ ID NO:53), Figure 55 (SEQ ID NO:55), Figure 57 (SEQ ID 
NO:57), Figure 59A-B (SEQ ID NO:59), Figure 61A-B (SEQ ID NO:61) or Figure 63 (SEQ ID NO:63). 

2. Isolated nucleic acid having at least 80% nucleic acid sequence identity to a nucleotide 
sequence consisting of the full-length coding sequence of the nucleotide sequence as shown in Figure 1 
(SEQ ID NO:l), Figure 3 (SEQ ID NO:3), Figure 5 (SEQ ID NO:5), Figure 7 (SEQ ID NO:7), Figure 9 
(SEQ ID NO:9), Figure 11 (SEQ ID NO: 11), Figure 13A-B (SEQ ID NO: 13), Figure 15 (SEQ ID NO:15), 
Figure 17 (SEQ ID NO: 17), Figure 19 (SEQ ID NO: 19), Figure 21 (SEQ ID NO:21), Figure 23 (SEQ ID 
NO:23), Figure 25 (SEQ ID NO:25), Figure 27 (SEQ ID NO:27), Figure 29 (SEQ ID NO:29), Figure 31A-B 
(SEQ ID NO:31), Figure 33 (SEQ ID NO:33), Figure 35 (SEQ ID NO:35), Figure 37 (SEQ ID NO:37), 
Figure 39A-B (SEQ ID NO:39), Figure 41A-C (SEQ ID NO:41), Figure 43 (SEQ ID NO:43), Figure 45 
(SEQ ID NO:45), Figure 47 (SEQ ID NO:47), Figure 49 (SEQ ID NO:49), Figure 51 (SEQ ID NO:51), 
Figure 53 (SEQ ID NO:53), Figure 55 (SEQ ID NO:55), Figure 57 (SEQ ID NO:57), Figure 59A-B (SEQ ID 
NO:59), Figure 61A-B (SEQ ID NO:61) or Figure 63 (SEQ ID NO:63). 

3. Isolated nucleic acid consisting of the nucleotide sequence shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:3), Figure 5 (SEQ ID NO:5), Figure 7 (SEQ ID NO:7), Figure 9 (SEQ ID 
NO:9), Figure 11 (SEQ ID NO:l 1), Figure 13A-B (SEQ ID NO:13), Figure 15 (SEQ ID NO:15), Figure 17 
(SEQ ID NO: 17), Figure 19 (SEQ ID NO: 19), Figure 21 (SEQ ID NO:21), Figure 23 (SEQ ID NO:23), 
Figure 25 (SEQ ID NO:25), Figure 27 (SEQ ID NO:27), Figure 29 (SEQ ID NO:29), Figure 31A-B (SEQ ID 
NO:31), Figure 33 (SEQ ID NO:33), Figure 35 (SEQ ID NO:35), Figure 37 (SEQ ID NO:37), Figure 39A-B 
(SEQ ID NO:39), Figure 41A-C (SEQ ID NO:41), Figure 43 (SEQ ID NO:43), Figure 45 (SEQ ID NO:45), 
Figure 47 (SEQ ID NO:47), Figure 49 (SEQ ID NO:49), Figure 51 (SEQ ID NO:51), Figure 53 (SEQ ID 
NO:53), Figure 55 (SEQ ID NO:55), Figure 57 (SEQ ID NO:57), Figure 59A-B (SEQ ID NO:59), Figure 
61 A-B (SEQ ID NO:61) or Figure 63 (SEQ ID NO:63). 

4. A vector comprising the nucleic acid of Claim 1 . 
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5. The vector of Claim 4 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

6. A host cell comprising the vector of Claim 4. 

5 

7. The host cell of Claim 6, wherein said cell is a CHO cell, an E.coli cell or a yeast cell. 

8. A process for producing a PRO polypeptide comprising culturing the host cell of Claim 6 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from 

10 the cell culture. 



9. An isolated polypeptide having at least 80% amino acid sequence identity to any one of 
the amino acid sequence of the polypeptides shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:4), 
Figure 6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ ID NO: 10), Figure 12 (SEQ ID NO: 12), 

15 Figure 14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO:18), Figure 20 (SEQ ID 
NO:20), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 
(SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), 
Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID 
NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 

20 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), 
Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62A-B (SEQ ID NO:62) or Figure 64 (SEQ 
ID NO:64). 

10. A chimeric molecule comprising a polypeptide according to Claim 9 fused to a 
25 heterologous amino acid sequence. 

11. The chimeric molecule of Claim 9, wherein said heterologous amino acid sequence is an 
epitope tag sequence or an Fc region of an immunoglobulin. 

30 12. An antibody which specifically binds to a polypeptide according to Claim 9. 

13. The antibody of Claim 12, wherein said antibody is a monoclonal antibody, a humanized 
antibody or a single-chain antibody. 

35 14. A composition of matter comprising (a) a polypeptide of Claim 9, (b) an agonist of said 

polypeptide, (c) an antagonist of said polypeptide, or (d) an antibody that binds to said polypeptide, in 
combination with a carrier. 

15. The composition of matter of Claim 14, wherein said carrier is a pharmaceutical^ 
4G acceptable carrier. 
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16. The composition of matter of Claim 1 5 comprising a therapeutically effective amount of 
(a), (b), (c) or (d). 

17. An article of manufacture, comprising: 
a container; 

a label on said container; and 

a composition of matter comprising (a) a polypeptide of Claim 9, (b) an agonist of said polypeptide, 
(c) an antagonist of said polypeptide, or (d) an antibody that binds to said polypeptide, contained within said 
container, wherein label on said container indicates that said composition of matter can be used for treating 
psoriasis. 

18. A method of identifying a coihpound that inhibits the expression of a gene encoding a 
PRO polypeptide of the invention as described in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:4), Figure 
6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ ID NO: 10), Figure 12 (SEQ ID NO: 12), Figure 
14 (SEQ ID NO: 14), Figure 16 (SEQ ID NO:16), Figure 18 (SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), 
Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID 
NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), Figure 36 
(SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), 
Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID 
NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), Figure 58 
(SEQ ID NO:58), Figure 60 (SEQ ID NO:60), Figure 62A-B (SEQ ID NO:62) or Figure 64 (SEQ ID 
NO:64), said method comprising contacting cells which normally express said polypeptide with a candidate 
compound, and determining the lack of expression said gene. 

19. The method of Claim 23, wherein said candidate compound is an antisense nucleic acid. 

20. A method of alleviating psoriasis in a mammal in need thereof comprising administering 
to said mammal a therapeutically effective amount of (a) a polypeptide of the invention as shown in Figure 8 
(SEQ ID NO:8), Figure 18 (SEQ ID NO: 18), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), 
Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID 
NO:34), Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B 
(SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), 
Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID 
NO:56), Figure 58 (SEQ ID NO:58) or Figure 60 (SEQ ID NO:60), (b) an antagonist of said polypeptide, or 
(c) an antibody that binds to said polypeptide. 

21. A method of diagnosing psoriasis in a mammal, said method comprising detecting the 
level of expression of a gene encoding a PRO polypeptide of the invention as described in Figure 8 (SEQ ID 
NO:8), Figure 18 (SEQ ID NO: 18), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 

89 



WO 2004/081199 



PCT/US2004/007862 



(SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), 
Figure 36 (SEQ ID NO:36), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID 
NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 
(SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56) } 
5 Figure 58 (SEQ ID NO:58) or Figure 60 (SEQ ID NO:60) (a) in a test sample of tissue cells obtained from 
the mammal, and (b) in a control sample of known normal tissue cells of the same cell type, wherein a 
higher or lower level of expression of said gene in the test sample as compared to the control sample is 
indicative of the presence of psoriasis in the mammal from which the test tissue cells were obtained. 

10 22, A method of diagnosing an psoriasis in a mammal, said method comprising (a) contacting 

a PRO polypeptide of the invention as described in Figure 8 (SEQ ID NO:8), Figure 18 (SEQ ID NO:18), 
Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID 
NO:30), Figure \32 (SEQ ID NO:32), Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 38 
(SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), 

15 Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID 
NO:52), Figure 54 (SEQ ID NO:54), Figure 56 (SEQ ID NO:56), Figure 58 (SEQ ID NO:58) or Figure 60 
(SEQ ID NO:60) anti-PRO antibody with a test sample of tissue cells obtained from said mammal and (b) 
detecting the formation of a complex between the antibody and the polypeptide in the test sample, wherein 
formation of said complex is indicative of the presence of psoriasis in the mammal from which the test tissue 

20 cells were obtained. 

23. A method of alleviating Crohn's disease in a mammal in need thereof comprising 
administering to said mammal a therapeutically effective amount of (a) a polypeptide of die invention as 
shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:4), Figure 12 (SEQ ID NO: 12), Figure 16 (SEQ - 
25 ID NO: 16), Figure 18 (SEQ ID NO:18), Figure 20 (SEQ ID NO:20), Figure 32 (SEQ ID NO:32), Figure 38 
(SEQ ID NO:38), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), Figure 48 (SEQ ID NO:48), 
Figure 56 (SEQ ID NO:56), Figure 58 (SEQ ID NO:58), or Figure 64 (SEQ ID NO:64), (b) an antagonist of 
said polypeptide, or (c) an antibody that binds to said polypeptide. 

30 24. A method of diagnosing Crohn's disease in a mammal, said method comprising detecting 

the level of expression of a gene encoding a PRO polypeptide of the invention as described in Figure 2 
(SEQ ID NO:2), Figure 4 (SEQ ID NO:4), Figure 12 (SEQ ID NO: 12), Figure 16 (SEQ ID NO:16), Figure 
18 (SEQ ID NO: 18), Figure 20 (SEQ ID NO:20), Figure 32 (SEQ ID NO:32), Figure 38 (SEQ ID NO:38), 
Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), Figure 48 (SEQ ID NO:48), Figure 56 (SEQ ID 

35 NO:56), Figure 58 (SEQ ID NO:58), or Figure 64 (SEQ ID NO:64) (a) in a test sample of tissue cells 
obtained from the mammal, and (b) in a control sample of known normal tissue cells of the same cell type, 
wherein a higher or lower level of expression of said gene in the test sample as compared to the control 
sample is indicative of the presence of Crohn's disease in the mammal from which the test tissue cells were 
obtained. 

40 
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25. A method of diagnosing Crohn's disease in a mammal, said method comprising (a) 
contacting a PRO polypeptide of the invention as described in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID 
NO:4), Figure 12 (SEQ ID NO:12), Figure 16 (SEQ ID NO: 16), Figure 18 (SEQ ID NO: 18), Figure 20 
(SEQ ED NO:20), Figure 32 (SEQ ID NO:32), Figure 38 (SEQ ID NO:38), Figure 40 (SEQ ID NO:40), 

5 Figure 42A-B (SEQ ID NO:42), Figure 48 (SEQ ID NO:48), Figure 56 (SEQ ID NO:56), Figure 58 (SEQ ID 
NO:58), or Figure 64 (SEQ ID NO:64) anti-PRO antibody with a test sample of tissue cells obtained from 
said mammal and (b) detecting the formation of a complex betv/een the antibody and the polypeptide in the 
test sample, wherein formation of said complex is indicative of the presence of Crohn's disease in the 
mammal from which the test tissue cells were obtained. 

10 

26. A method of alleviating Ulcerative Colitis in a mammal in need thereof comprising 
administering to said mammal a therapeutically effective amount of (a) a polypeptide of the invention as 
shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:4), Figure 6 (SEQ ED NC:6), Figure 8 (SEQ ID 
NO:8), Figure 10 (SEQ ID NO: 10), Figure 16 (SEQ ID NO: 16), Figure 22 (SEQ ID NO:22), Figure 24 

15 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), 

Figure 42A-B (SEQ ID NO:42), Figure 48 (SEQ ID NO:48), Figure 52 (SEQ ID NO:52), or Figure 64 (SEQ 
ID NO:64), (b) an antagonist of said polypeptide, or (c) an antibody that binds to said polypeptide. 

27. A method of diagnosing Ulcerative Colitis in a mammal, said method comprising 
20 detecting the level of expression of a gene encoding a PRO polypeptide of the invention as described in 

Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:4), Figure 6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8), 
Figure 10 (SEQ ID NO: 10), Figure 16 (SEQ ID NO: 16), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID 
NO:24), Figure 26 (SEQ ID NO:26), Figure 28 (SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 42A-B 
(SEQ ID NO:42), Figure 48 (SEQ ID NO:48), Figure 52 (SEQ ID NO:52), or Figure 64 (SEQ ID NO:64), 
25 (a) in a test sample of tissue cells obtained from the mammal, and (b) in a control sample of known normal 
tissue cells of the same cell type, wherein a higher or lower level of expression of said gene in the test 
sample as compared to the control sample is indicative of the presence of Ulcerative Colitis in the mammal 
from which the test tissue cells were obtained. 

30 28. A method of diagnosing Ulcerative Colitis in a mammal, said method comprising (a) 

contacting a PRO polypeptide of the invention as described in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID 
NO:4), Figure 6 (SEQ ID NO:6), Figure 8 (SEQ ID NO:8), Figure 10 (SEQ ID NO: 10), Figure 16 (SEQ ID 
NO: 16), Figure 22 (SEQ ID NO:22), Figure 24 (SEQ ID NO:24), Figure 26 (SEQ ID NO:26), Figure 28 
(SEQ ID NO:28), Figure 30 (SEQ ID NO:30), Figure 42A-B (SEQ ID NO:42), Figure 48 (SEQ ID NO:48), 

35 Figure 52 (SEQ ID NO:52), or Figure 64 (SEQ ID NO:64), anti-PRO antibody with a test sample of tissue 
cells obtained from said mammal and (b) detecting the formation of a complex between the antibody and the 
polypeptide in the test sample, wherein formation of said complex is indicative of the presence of Ulcerative 
Colitis in the mammal from which the test tissue cells were obtained. 
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29. A method of alleviating rheumatoid arthritis in a mammal in need thereof comprising 
administering to said mammal a therapeutically effective amount of (a) a polypeptide of the invention as 
shown in Figure 10 (SEQ ID NO: 10), Figure 14 (SEQ ID NO: 14), Figure 18 (SEQ ID NO: 18), Figure 34 
(SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), 
5 Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID 
NO:50), Figure 52 (SEQ ID NO:52), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), or Figure 
62A-B (SEQ ID NO:62) (b) an antagonist of said polypeptide, or (c) an antibody that binds to said 
polypeptide. 

10 30. A method of diagnosing rheumatoid arthritis in a mammal, said method comprising 

detecting the level, of expression of a gene encoding a PRO polypeptide of the invention as described in 
Figure 10 (SEQ ID NO: 10), Figure 14 (SEQ ID NO:14), Figure 18 (SEQ ID NO: 18), Figure 34 (SEQ ID 
NO:34), Figure 36 (SEQ ID NO:36), Figure 40 (SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), Figure 44 
(SEQ ID NO:44), Figure 46 (SEQ ID NO:46), Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID NO:50), 

15 Figure 52 (SEQ ID NO:52), Figure 58 (SEQ ID NO:58), Figure 60 (SEQ ID NO:60), or Figure 62A-B (SEQ 
ID NO:62) (a) in a test sample of tissue cells obtained from the mammal, and (b) in a control sample of 
known normal tissue cells of the same cell type, wherein a higher or lower level of expression of said gene in 
the test sample as compared to the control sample is indicative of the presence of rheumatoid arthritis in the 
mammal from which the test tissue cells were obtained. 

20 

31. A method of diagnosing rheumatoid arthritis in a mammal, said method comprising (a) 
contacting a PRO polypeptide of the invention as described in Figure 10 (SEQ ID NO: 10), Figure 14 (SEQ 
ID NO:14) 5 Figure 18 (SEQ ID NO: 18), Figure 34 (SEQ ID NO:34), Figure 36 (SEQ ID NO:36), Figure 40 
(SEQ ID NO:40), Figure 42A-B (SEQ ID NO:42), Figure 44 (SEQ ID NO:44), Figure 46 (SEQ ID NO:46), 
25 Figure 48 (SEQ ID NO:48), Figure 50 (SEQ ID NO:50), Figure 52 (SEQ ID NO:52), Figure 58 (SEQ ID 
NO:58), Figure 60 (SEQ ID NO:60), or Figure 62A-B (SEQ ID NO:62) anti-PRO antibody with a test 
sample of tissue cells obtained from said mammal and (b) detecting the formation of a complex between the 
antibody and the polypeptide in the test sample, wherein formation of said complex is indicative of the 
presence of rheumatoid arthritis in the mammal from which the test tissue cells were obtained. 

30 
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FIGURE 1 



GGGACGGCGACAGCGGGTCGGCGGGCCGCAGGAGGGGGTC&TGGGTAAAGACTACTACCA 

GACGTTGGGCCTGGCCCGCGGCGCGTCGGACGAGGAGATCAAGCGGGCCTACCGCCGCCA 

GGCGCTGCGCTACCACCCGGACAAGAACAAGGAGCCCGGCGCCGAGGAGAAGTTCAAGGA 

GATCGCTGAGGCCTACGACGTGCTCAGCGACCCGCGCAAGCGCGAGATCTTCGACCGCTA 

CGGGGAGGAAGGCCTAAAGGGGAGTGGCCCCAGTGGCGGTAGCGGCGGTGGTGCCAATGG 

TACCTCTTTCAGCTACACATTCCATGGAGACCCTCATGCCATGTTTGCTGAGTTCTTCGG 

TGGCAGAAATCCCTTTGACACCTTTTTTGGGCAGCGGAACGGGGAGGAAGGCATGGACAT 

TGATGACCCATTCTCTGGCTTCCCTATGGGCATGGGTGGCTTCACCAACGTGAACTTTGG 

CCGCTCCCGCTCTGCCCAAGAGCCCGCCCGAAAGAAGCAAGATCCCCCAGTCACCCACGA 

CCTTCGAGTCTCCCTTGAAGAGATCTACAGCGGCTGTACCAAGAAGATGAAAATCTCCCA 

CAAGCGGCTAAACCCCGACGGAAAGAGCATTCGAAACGAAGACAAAATATTGACCATCGA 

AGTGAAGAAGGGGTGGAAAGAAGGAACCAAAATCACTTTCCCCAAGGAAGGAGACCAGAC 

CTCCAACAACATTCCAGCTGATATCGTCTTTGTTTTAAAGGACAAGCCCCACAATATCTT 

TAAGAGAGATGGCTCTGATGTCATTTATCCTGCCAGGATCAGCCTCCGGGAGGCTCTGTG 

TGGCTGCACAGTGAACGTCCCCACTCTGGACGGCAGGACGATACCCGTCGTATTCAAAGA 

TGTTATCAGGCCTGGCATGCGGCGAAAAGTTCCTGGAGAAGGCCTCCCCCTCCCCAAAAC 

ACCCGAGAAACGTGGGGACCTCATTATTGAGTTTGAAGTGATCTTCCCCGAAAGGATTCC 

CCAGACATCAAGAACCGTACTTGAGCAGGTTCTTCCAATATAGCTATCTGAGCTCCCCAA 

GGACTGACCAGGGACCTTTCCAGAGCTCAAGGATTTCTGGACCTTTCTACCAGTTGTGGA 

CCATGAGAGGGTGGGAGGGCCCAGGGAGGGCTTTCGTACTGCTGAATGTTTTCCAGAGCA 

TATATTACAATCTTTCAAAGTCGCACACTAGACTTCAGTGGTTTTTCGAGCTATAGGGCA 

TCAGGTGGTGGGAACAGCAGGAAAAGGCATTCCAGTCTGCCCCACTGGGTCTGGCAGCCC 

TCCCGGGATGGGCCCACATCCACCTCCAGTCCCTGGCCAGGGGTGAGAGGCAGACCAGCA 

GATGGACTTGATCCCTCTGTGTCTTTGGGCTTCTGGCTGGTAGATAATGTCAACCTGCAG 

TCTTGATTCCCAGACCCTGTACACTCCTCCTTTTCTGTTGTGTGATCAGTTTGTGCTTTA 

TTCTGTATTTGTCTCCCATGTCTTGCTCTTCTCCTGGAGAATTCTGTCTTCTCTTTGGCC 

ATCTCAAATTGAGAACCTAAACTATTCCTGCAGAACTGCCTGGTTGGCGTCCACAAGCAA 

TACCTCTCGTTCCAGCAGGACCAAGGGAGCCAGCCTCCAGTGAGTGACTCCAGCAAGTGC 

AGCCACCTCTCCCTTGATGGTCTGGGAGCCTGGCCTCAGCAAGGGGCCTTCCTGACCTCT 

GGCTCCAGTGAAGCTGAATGTCCTCACTTTGTGGGTCACACTCTTTACATTTCTGTAAGG 

CAATCTTGGCACACGTGGGGCTTACCAGTGGCCCAGGTAATTTTTTGTTTCATGGACTAT 

GGACTCTTTCAAAGGGATCTGATCCTTTTGAATTTTGCACAGCCCTAGATACAATCCCTT 

TTGATAAAAGGGTCTTTGCTTCTGATTACAGGAGCACTGTGGAACGTCTGTAAATATGTT 

TTTATAATTCCATGTATAGTTGGTGTACACTCAAAACCTGTCCCCGGCAGCCAGTGCTCT 

CTGTATAGGGCCATAATGGAATTCTGAAGAAATCTTGGGGAGGGAAGGGGAGTTGGAACA 

AATGTCTGTTCCCTGGAGGCCAGTCCAGTGCTCAGACCTTTAGACTCATTGTAAGTTGCC 

ACTGCCAACATGAGACCAAAGTGTGTGACTAGTCAATGAAGTGCGACAGCATTAAAGACT 
GATGCTAAACCTC 
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FIGURE 2 



MGKDYYQTIiGLARGASDEEIKRAYRRQALRYHPDKNKEPGAEEKFKEIAEAYDVLSDPRK 
REIFDRYGEEGLKGSGPSGGSGGGANGTSFSYTFHGDPHAMFAEFFGGRNPFDTFFGQRN 
GEEGMDIDDPFSGFPMGMGGFTNVNFGRSRSAQEPARKKQDPPVTHDLRVSLEEIYSGCT 
KKMKISHKRLNPDGKSIRNEDKILTIEVKKGWKEGTKITFPKEGDQTSNNIPADIVFVLK 
DKPHNIFKRDGSDVIYPARISLREALCGCTVNVPTLDGRTIPVVFKDVIRPGMRRKVPGE 
GLPLPKTPEKRGDLIIEFEVIFPERIPQTSRTVLEQVLPI 



signal sequence 
none 

Transmembrane domain 
none 

N-glycosylation site. 

86- 89 

Glycosaminoglycan attachment site. 

81- 84 

Tyrosine kinase phosphorylation site. 
169-176 
249-255 

N-myristoylation site. 
10-15 
71-76 
74-79 

79- 84 

80- 85 

82- 87 

83- 88 

84- 89 

87- 92 
117-122 
139-144 

215- 220 
268-273 

Cell attachment sequence. 
311-313 

Nt-dnaJ domain signature. 
45-65 

DnaJ domain 
4-68 

DnaJ C terminal region 

216- 338 
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FIGURE 3 



AGAGTTGCACTGAGTGTGGCTGAAGCAGCGAGGCGGGAGTGGAGGTGCGCGGAGTCAGGC 
AGACAGACAGACACAGCCAGCCAGCCAGGTCGGCAGTATAGTCCGAACTGCAAATCTTAT 
TTTCTTTTCACCTTCTCTCTAACTGCCCAGAGCTAGCGCCTGTGGCTCCCGGGCTGGTGG 
TTCGGGAGTGTCCAGAGAGCCTTGTCTCCAGCCGGCCCCGGGAGGAGAGCCCTGCTGCCC 
AGGCGCTGTTGACAGCGGCGGAAAGCAGCGGTACCCCACGCGCCCGCCGGGGGACGTCGG 
CGAGCGGCTGCAGCAGCAAAGAACTTTCCCGGCGGGGAGGACCGGAGACAAGTGGCAGAG 
TCCCGGAGCGAACTTTTGCAAGCCTTTCCTGCGTCTTAGGCTTCTCCACGGCGGTAAAGA 
CCAGAAGGCGGCGGAGAGCCACGCAAGAGAAGAAGGACGTGCGCTCAGCTTCGCTCGCAC 
CGGTTGTTGAACTTGGGCGAGCGCGAGCCGCGGCTGCCGGGCGCCCCCTCCCCCTAGCAG 
CGGAGGAGGGGACAAGTCGTCGGAGTCCGGGCGGCCAAGACCCGCCGCCGGCCGGCCACT 
GCAGGGTCCGCACTGATCCGCTCCGCGGGGAGAGCCGCTGCTCTGGGAAGTGAGTTCGCC 
\ TGCGGACTCCGAGGAACCGCTGCGCCCGAAGAGCGCTCAGTGAGTGACCGCGACTTTTCA 
AAGCCGGGTAGCGCGCGCGAGTCGACAAGTAAGAGTGCGGGAGGCATCTTAATTAACCCT 
GCGCTCCCTGGAGCGAGCTGGTGAGGAGGGCGCAGCGGGGACGACAGCCAGCGGGTGCGT 
GCGCTCTTAGAGAAACTTTCCCTGTCAAAGGCTCCGGGGGGCGCGGGTGTCCCCCGCTTG 
CCAGAGCCCTGTTGCGGCCCCGAAACTTGTGCGCGCACGCCAAACT/LACCTCACGTGAAG 
TGACGGACTGTTCTATGACTGCAAAGATGGAAACGACCTTCTATGACGATGCCCTCAACG 
CCTCGTTCCTCCCGTCCGAGAGCGGACCTTATGGCTACAGTAACCCCAAGATCCTGAAAC 
AGAGCATGACCCTGAACCTGGCCGACCCAGTGGGGAGCCTGAAGCCGCACCTCCGCGCCA 
AGAACTCGGACCTCCTCACCTCGCCCGACGTGGGGCTGCTCAAGCTGGCGTCGCCCGAGC 
TGGAGCGCCTGATAATCCAGTCCAGCAACGGGCACATCACCACCACGCCGACCCCCACCC 
AGTTCCTGTGCCCCAAGAACGTGACAGATGAGCAGGAGGGGTTCGCCGAGGGCTTCGTGC 
GCGCCCTGGCCGAACTGCACAGCCAGAACACGCTGCCCAGCGTCACGTCGGCGGCGCAGC 
CGGTCAACGGGGCAGGCATGGTGGCTCCCGCGGTAGCCTCGGTGGCAGGGGGCAGCGGCA 
GCGGCGGCTTCAGCGCCAGCCTGCACAGCGAGCCGCCGGTCTACGCAAACCTCAGCAACT 
TCAACCCAGGCGCGCTGAGCAGCGGCGGCGGGGCGCCCTCCTACGGCGCGGCCGGCCTGG 
CCTTTCCCGCGCAACCCCAGCAGCAGCAGCAGCCGCCGCACCACCTGCCCCAGCAGATGC 
CCGTGCAGCACCCGCGGCTGCAGGCCCTGAAGGAGGAGCCTCAGACAGTGCCCGAGATGC 
CCGGCGAGACACCGCCCCTGTCCCCCATCGACATGGAGTCCCAGGAGCGGATCAAGGCGG 
AGAGGAAGCGCATGAGGAACCGCATCGCTGCCTCCAAGTGCCGAAAAAGGAAGCTGGAGA 
GAATCGCCCGGCTGGAGGAAAAAGTGAAAACCTTGAAAGCTCAGAACTCGGAGCTGGCGT 
CCACGGCCAACATGCTCAGGGAACAGGTGGCACAGCTTAAACAGAAAGTCATGAACCACG 
TTAACAGTGGGTGCCAACTCATGCTAACGCAGCAGTTGCAAACATTTTGAAGAGAGACCG 
' TCGGGGGCTGAGGGGCAACGAAGAAAAAAAATAACACAGAGAGACAGACTTGAGAACTTG 
ACAAGTTGCGACGGAGAGAAAAAAGAAGTGTCCGAGAACTAAAGCCAAGGGTATCCAAGT 
TGGACTGGGTTCGGTCTGACGGCGCCCCCAGTGTGCACGAGTGGGAAGGACTTGGTCGCG 
CCCTCCCTTGGCGTGGAGCCAGGGAGCGGCCGCCTGCGGGCTGCCCCGCTTTGCGGACGG 
GCTGTCCCCGCGCGAACGGAACGTTGGACTTTCGTTAACATTGACCAAGAACTGCATGGA 
CCTAACATTCGATCTCATTCAGTATTAAAGGGGGGAGGGGGAGGGGGTTACAAACTGCAA 
TAGAGACTGTAGATTGCTTCTGTAGTACTCCTTAAGAACACAAAGCGGGGGGAGGGTTGG 
GGAGGGGCGGCAGGAGGGAGGTTTGTGAGAGCGAGGCTGAGCCTACAGATGAACTCTTTC 
TGGCCTGCTTTCGTTAACTGTGTATGTACATATATATATTTTTTAATTTGATTAAAGCTG 
ATTACTGTCAATAAACAGCTTCATGCCTTTGTAAGTTATTTCTTGTTTGTTTGTTTGGGT 
ATCCTGCCCAGTGTTGTTTGTAAATAAGAGATTTGGAGCACTCTGAGTTTACCATTTGTA 
ATAAAGTATATAATTTTTTTATGTTTTGTTTCTGAAAATTCCAGAAAGGATATTTAAGAA 
AATACAATAAACTATTGGAAAGTACTCCCCTAACCTCTTTTCTGCATCATCTGTAGATCC 
TAGTCTATCTAGGTGGAGTTGAAAGAGTTAAGAATGCTCGATAAAATCACTCTCAGTGCT 
TCTTACTATTAAGCAGTAAAAACTGTTCTCTATTAGACTTAGAAATAAATGTACCTGATG 
TACCTGATGCTATGTCAGGCTTCATACTCCACGCTCCCCCAGCGTATCTATATGGAATTG 
CTTACCAAAGGCTAGTGCGATGTTTCAGGAGGCTGGAGGAAGGGGGGTTGCAGTGGAGAG 
GGACAGCCCACTGAGAAGTCAAACATTTCAAAGTTTGGATTGCATCAAGTGGCATGTGCT 
GTGACCATTTATAATGTTAGAAATTTTACAATAGGTGCTTATTCTCAAAGCAGGAATTGG 
TGGCAGATTTTACAAAAGATGTATCCTTCCAATTTGGAATCTTCTCTTTGACAATTCCTA 
GATAAAAAGATGGCCTTTGTCTTATGAATATTTATAACAGCATTCTGTCACAATAAATGT 
ATTCAAATAC CAAT 
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FIGURE 4 



MTAKMETTPYDDALNASFLPSESGPYGYSNPKILKQSMTLNLADPVGSLKPHLRAKNSDL 
LTS PDVGLLKLAS PELERLI I QS SNGHITTTPTPTQFLCPKNVTDEQEGFAEGFVRALAE 
LHSQNTLPSVTSAAQPWGAGMVAPAVASVAGGSGSGGFSASLHSEPPVYANLSNFNPGA 
LSSGGGAPSYGAAGLAFPAQPQQQQQPPHHLPQQMPVQHPRLQALKEEPQTVPEMPGETP 
PLSPIDMESQERIKAERKRMRNRIAASKCRKRKLERIARLEEKVKTLKAQNSEIiASTANM 
LREQVAQLKQKVMNHVNS GCQLMLTQQLQTF 



signal sequence 
none 

Transmembrane domain 
none 

N-glycosylation site. 
15-18 
102-105 
172-175 

Glycosaminoglycan attachment site . 
154-157 
183-186 



N-myristoylation site. 

152- 157 

153- 158 
157-162 
179-184 
185-190 

Leucine zipper pattern. 
280-301 
287-308 

bZIP transcription factors basic domain signature. 
257-272 



Jun-like transcription factor 
5-241 

bZIP transcription factor 
250-314 
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FIGURE 5 



CTCGGCTCACCATGTGTCACTCTCGCAGCTGCCACCCGACCATGACCATCCTGCAGGCCC 

CGACCCCGGCCCCCTCCACCATCCCGGGACCCCGGCGGGGCTCCGGTCCTGAGATCTTCA 

CCTTCGACCCTCTCCCGGAGCCCGCAGCGGCCCCTGCCGGGCGCCCCAGCGCCTCTCGCG 

GGCACCGAAAGCGCAGCCGCAGGGTTCTCTACCCTCGAGTGGTCCGGCGCCAGCTGCCAG 

TCGAGGAACCGAACCCAGCCAAAAGGCTTCTCTTTCTGCTGCTCACCATCGTCTTCTGCC 

AGATCCTGATGGCTGAAGAGGGTGTGCCGGCGCCCCTGCCTCCAGAGGACGCCCCTAACG 

CCGCATCCCTGGCGCCCACCCCTGTGTCCCCCGTCCTCGAGCCCTTTAATCTGACTTCGG 

AGCCCTCGGACTACGCTCTGGACCTCAGCACTTTCCTCCAGCAACACCCGGCCGCCTTCT 

MCTGTGACTCCCCGC^CTCCCCAAAAAGAATCCGAAAAACCACAAAGAAACACCAGGCG 

TACCTGGTGCGCGAGAGCGTATCCCCAACTGGGACTTCCGAGGCAACTTGAACTCAGAAC 

ACTACAGCGGAGACGCCACCCGGTGCTTGAGGCGGGACCGAGGCGCACAGAGACCGAGGC 

GCATAGAGACCGAGGCACAGCCCAv^CTGGGGCTAGGCCCGGTGGGAAGGAGAGCGTCGTT 

AATTTATTTCTTATTGCTCCTAATTAATATTTATATGTATTTATGTACGTCCTCCTAGGT 

GATGGAGATGTGTACGTAATATTTATTTTAACTTATGCAAGGGTGTGAGATGTTCCCCCT 

GCTGTAAATGCAGGTCTCTTGGTATTTATTGAGCTTTGTGGGACTGGTGGAAGCAGGACA 

CCTGGAACTGCGGCAAAGTAGGAGAAGAAATGGGGAGGACTCGGGTGGGGGAGGACGTCC 

CGGCTGGGATGAAGTCTGGTGGTGGGTCGTAAGTTTAGGAGGTGACTGCATCCTCCAGCA 

TCTCAACTCCGTCTGTCTACTGTGTGAGACTTCGGCGGACCATTAGGAATGAGATCCGTG 

AGATCCTTCCATCTTCTTGAAGTCGCCTTTAGGGTGGCTACGAGGTAGAGGGGTGGGGGT 

TGGTGGGCTGTCACGGAGCGACTGTCGAGATCGCCTAGTATGTTCTGTGAACACAAATAA 
AATTGATTTACTGTCTGCTAAAAAAAAAAA 
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MGUME 6 



MCHSRSCHPTMTILQAPTPAPSTIPGPRRGSGPEIFTFDPLPEPAAAPAGRPSASRGHRK 
RSRRVLYPRWRRQLPVEEPNPAKRLLFLLLTIVFCQILMAEEGVPAPLPPEDAPNAASL 
APTPVSPVLEPFNLTSEPSDYALDLSTFLQQHPAAF 



Signal sequence 
none 

Transmembrane domain 
78-98 

N-glycosylation site. 



133-136 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 



28-31 
59-62 
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FIGURE 7 



AGCTGAGGTGTGAGCAGCTGCCGAAGTCAGTTCCTTGTGGAGCCGGAGCTGGGCGCGGAT 

TCGCCGAGGCACCGAGGCACTCAGAGGAGGTGAGAGAGCGGCGGCAGACAACAGGGGACC 

CCGGGCCGGCGGCCCAGAGCCGAGCCAAGCGTGCCCGCGTGTGTCCCTGCGTGTCCGCGA 

GGATGCGTGTTCGCGGGTGTGTGCTGCGTTCACAGGTGTTTCTGCGGCAGGCGCCATGTC 

AGAACCGGCTGGGGATGTCCGTCAGAACCCATGCGGCAGCAAGGCCTGCCGCCGCCTCTT 

CGGCCCAGTGGACAGCGAGCAGCTGAGCCGCGACTGTGATGCGCTAATGGCGGGCTGCAT 

CCAGGAGGCCCGTGAGCGATGGAACTTCGACTTTGTCACCGAGACACCACTGGAGGGTGA 

CTTCGCCTGGGAGCGTGTGCGGGGCCTTGGCCTGCCCAAGCTCTACCTTCCCACGGGGCC 

CCGGCGAGGCCGGGATGAGTTGGGAGGAGGCAGGCGGCCTGGCACCTCACCTGCTCTGCT 

GCAGGGGACAGCAGAGGAAGACCATGTGGACCTGTCACTGTCTTGTACCCTTGTGCCTCG 

CTCAGGGGAGCAGGCTGAAGGGTCCCCAGGTGGACCTGGAGACTCTCAGGGTCGAAAACG 

GCGGCAGACCAGCATGACAGATTTCTACCACTCCAAACGCCGGCTGATCTTCTCCAApAG 

GAAGCCCTAATCCGCCCACAGGAAGCCTGCAGTCCTGGAAGCGCGAGGGCCTCAAAGGCC 

CGCTCTACATCTTCTGCCTTAGTCTCAGTTTGTGTGTCTTAATTATTATTTGTGTTTTAA 

TTTAAACACCTCCTCATGTACATACCCTGGCCGCCCCCTGCCCCCCAGCCTCTGGCATTA 

GAATTATTTAAACAAAAACTAGGCGGTTGAATGAGAGGTTCCTAAGAGTGCTGGGCATTT 

TTATTTTATGAAATACTATTTAAAGCCTCCTCATCCCGTGTTCTCCTTTTCCTCTCTCCC 

GGAGGTTGGGTGGGCCGGCTTCATGCCAGCTACTTCCTCCTCCCCACTTGTCCGCTGGGT 

GGTACCCTCTGGAGGGGTGTGGCTCCTTCCCATCGCTGTCACAGGCGGTTATGAAATTCA 

CCCCCTTTCCTGGACACTCAGACCTGAATTCTTTTTCATTTGAGAAGTAAACAGATGGCA 

CTTTGAAGGGGCCTCACCGAGTGGGGGCATCATCAAAAACTTTGGAGTCCCCTCACCTCC 

TCTAAGGTTGGGCAGGGTGACCCTGAAGTGAGCACAGCCTAGGGCTGAGCTGGGGACCTG 

GTACCCTCCTGGCTCTTGATACCCCCCTCTGTCTTGTGAAGGCAGGGGGAAGGTGGGGTC 

CTGGAGCAGACCACCCCGCCTGCCCTCATGGCCCCTCTGACCTGCACTGGGGAGCCCGTC 

TCAGTGTTGAGCCTTTTCCCTCTTTGGCTCCCCTGTACCTTTTGAGGAGCCCCAGCTACC 

CTTCTTCTCCAGCTGGGCTCTGCAATTCCCCTCTGCTGCTGTCCCTCCCCCTTGTCCTTT 

CCCTTCAGTACCCTCTCAGCTCCAGGTGGCTCTGAGGTGCCTGTCCCACCCCCACCCCCA 

GCTCAATGGACTGGAAGGGGAAGGGACACACAAGAAGAAGGGCACCCTAGTTCTACCTCA 

GGCAGCTCAAGCAGCGACCGCCCCCTCCTCTAGCTGTGGGGGTGAGGGTCCCATGTGGTG 

GCACAGGCCCCCTTGAGTGGGGTTATCTCTGTGTTAGGGGTATATGATGGGGGAGTAGAT 

CTTTCTAGGAGGGAGACACTGGCCCCTCAAATCGTCCAGCGACCTTCCTCATCCACCCCA 

TCCCTCCCCAGTTCATTGCACTTTGATTAGCAGCGGAACAAGGAGTCAGACATTTTAAGA 

TGGTGGCAGTAGAGGCTATGGACAGGGCATGCCACGTGGGCTCATATGGGGCTGGGAGTA 

GTTGTCTTTCCTGGCACTAACGTTGAGCCCCTGGAGGCACTGAAGTGCTTAGTGTACTTG 

GAGTATTGGGGTCTGACCCCAAACACCTTCCAGCTCCTGTAACATACTGGCCTGGACTGT 

TTTCTCTCGGCTCCCCATGTGTCCTGGTTCCCGTTTCTCCACCTAGACTGTAAACCTCTC 

GAGGGCAGGGACCACACCCTGTACTGTTCTGTGTCTTTCACAGCTCCTCCCACAATGCTG 

AATATACAGCAGGTGCTCAATAAATGATTCTTAGTGACTTTA2VAAAAAAAAAAAAAAAAA 

A 
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FIGURES 



MSEPAGDVRQNPCGSKACRRLFGPVDSEQLSRDCDALMAGCIQEARERWNFDFVTETPLE 
GDFAWERVRGLGLPKLYLPTGPRRGRDELGGGRRPGTSPALLQGTAEEDHVDLSLSCTLV 
PRSGEQAEGSPGGPGDSQGRKRRQTSMTDFYHSKRRLIFSKRKP 

signal sequence 
none 

Transmembrane domain 
none 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
142-145 \ 

N-myristoylation site. 
14-19 
96-101 

Amidation site. 
91-96 
138-143 

Cyclin-dependent kinase inhibitor 
15-114 
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FIGURE 9 



CAGCTCTCTGTCAGi^GGCCACCATGGTACCATCCGTGTTGTGGCCCAGGGCCTGCTGG 
ACTCTGCTGGTCTGCTGTCTGCTGACCCCAGGTGTC 

GTGGAGCCCGAGAACCCTGTGCTCTCTGCTGGAGGGTCCCTGTTTGTGAACTGCAGTACT 

GATTGTCCCAGCTCTGAGAAAATCGCCTTGGAGACGTCCCTATCAAAGGAGCTGGTGGCC 

AGTGGCATGGGCTGGGCAGCCTTCAATCTCAGCAACGTGACTGGCAACAGTCGGATCCTC 

TGCTCAGTGTACTGCAATGGCTCCCAGATAACAGGCTCCTCTAACATCACCGTGTACGGG 

CTCCCGGAGCGTGTGGAGCTGGCACCCCTGCCTCCTTGGCAGCCGGTGGGCCAGAACTTC 

ACCCTGCGCTGCCAAGTGGAGGGTGGGTCGCCCCGGACCAGCCTCACGGTGGTGCTGCTT 

CGCTGGGAGGAGGAGCTGAGCCGGCAGCCCGCAGTGGAGGAGCCAGCGGAGGTCACTGCC 

ACTGTGCTGGCCAGCAGAGACGACCACGGAGCCCCTTTCTCATGCCGCACAGAACTGGAC 

ATGCAGCCCCAGGGGCTGGGACTGTTCGTGAACACCTCAGCCCCCCGCCAGCTCCGAACC ' 

^n^^™ CCCGTGACCCCCCCGCGCCTCGTGGCCCCCCGG ^CTTGGAGGTC 

^S^ GGCCGGTGGACTGCACCCTAGACGGGC TTTTTCCAGCCTCAGAGGCCCAGGTCTAC 

CTGGCGCTGGGGGACCAGATGCTGAATGCGACAGTCATGAACCACGGGGACACGCTAACG 

GCCACAGCCACAGCCACGGCGCGCGCGGATCAGGAGGGTGCCCGGGAGATCGTCTGCAAC 

GTGACCCTAGGGGGCGAGAGACGGGAGGCCCGGGAGAACTTGACGGTCTTTAGCTTCCTA 

GGACCCATTGTGAACCTCAGCGAGCCCACCGCCCATGAGGGGTCCACAGTGACCGTGAGT 

TGCATGGCTGGGGCTCGAGTCCAGGTCACGCTGGACGGAGTTCCGGCCGCGGCCCCGGGG 

CAGCCAGCTCAACTTCAGCTAAATGCTACCGAGAGTGACGACGGACGCAGCTTCTTCTGC 

AGTGCCACTCTCGAGGTGGACGGCGAGTTCTTGCACAGGAACAGTAGCGTCCAGCTGCGA 

GTCCTGTATGGTCCCAAAATTGACCGAGCCACATGCCCCCAGCACTTGAAATGGAAAGAT 

AAAACGAGACACGTCCTGCAGTGCCAAGCCAGGGGCAACCCGTACCCCGAGCTGCGGTGT 

TTGAAGGAAGGCTCCAGCCGGGAGGTGCCGGTGGGGATCCCGTTCTTCGTCAACGTAACA 

CATAATGGTACTTATCAGTGCCAAGCGTCCAGCTCACGAGGCAAATACACCCTGGTCGTG 

GTGATGGACATTGAGGCTGGGAGCTCCCACTTTGTCCCCGTCTTCGTGGCGGTGTTACTG 

ACCCTGGGCGTGGTGACTATCGTACTGGCCTTAATGTACGTCTTCAGGGAGCACCAACGG 

AGCGGCAGTTACCATGTTAGGGAGGAGAGCACCTATCTGCCCCTCACGTCTATGCAGCCG 

ACAGAAGCAATGGGGGAAGAACCGTCCAGAGCTGAGTGACGCTGGGATCCGGGATCAAAG 

TTGGCGGGGGCTTGGCTGTGCCCTCAGATTCCGCACCAATAAAGCCTTCAAACTCCCTAA 

AAAAAAAAAAAAAAAAAAAAAAAA - - - 
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FIGURE 10 



MATMVPSVLWPRACWTLLVCCLLTPGVQGQEFLLRVEPQNPVLSAGGSLFVNCSTDCPSS 
EKIALETSLSKELVASGMGWAAFNLSNVTGNSRILCSVYCNGSQITGSSNITVYGLPERV 
ELAPLPPWQPVGQNFTLRCQVEGGSPRTSLTVVLLRWEEELSRQPAVEEPAEVTATVLAS 
RDDHGAPFSCRTELDMQPQGLGLFVNTSAPRQLRTFVLPVTPPRLVAPRFLEVETSWPVD 
CTLDGLFPASEAQVYLALGDQMLNATVMNHGDTLTATATATARADQEGAREIVOSrVTLGG 
ERREARENLTVFSFLGPIVNLSEPTAHEGSTVTVSCMAGARVQVTLDGVPAAAPGQPAQL 
QLNATESDDGRSFFCSATLEVDGEFLHRNSSVQLRVLYGPKIDRATCPQHLKWKDKTRHV 
LQCQARGNPYPELRCLKEGSSREVPVGIPFFVNVTHNGTYQCQASSSRGK^'TLVVVMDIE 
AGSSHFVPVFVAVLLTLGWTIVLALMYVFREHQRSGSYHVREESTYLPLTSMQPTEAMG 

EEPSRAE 

Signal sequence 
1-29 

Transmembrane domain 
484-504 

N-glycosylation site. 
52-55 
84-87 
87-90 

101- 104 
110-113 
134-137 
206-209 
264-267 
295-298 
308-311 
320-323 
363-366 
389-392 
453-456 

457- 460 

Glycosaminoglycan attachment site. 

76- 79 

N-myristoylation site. 

77- 82 

102- 107 
132-137 
144-149 
185-190 
202-207 
245-250 
329-334 
348-353 

458- 463 

Intercellular adhesion molecule (ICAM) 
12-119 



Immunoglobulin domain 
132-192 
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FIGURE 11 



GTTTGGGGGTTGTTTGGGATTAGTGAAGCTACTGCCTTTGCCGCCAGCGCAGCCTCAGAG 
TTTGATTATTTGCAATGTCAGGCTTTGAAAACTTAAACACGGATTTCTACCAGACAAGTT 
ACAGCAT CGATGATCAGTCACAGCAGTC CTATGATTATGGAGGAAGTGGAGGACCCTATA 

GCAAACAGTATGCTGGCTATGACTATTCGCAGCAAGGCAGATTTGTCCCTCCAGACATGA 

TGCAGCCACAACAGCCATACACCGGGCAGATTTACCAGCCAACTCAGGCATATACTCCAG 

CTTCACCTCAGCCTTTCTATGGAAACAACTTTGAGGATGAGCCACCTTTATTAGAAGAGT 

TAGGTATCAATTTTGACCACATCTGGCAAAAAACACTAACAGTATTACATCCGTTAAAAG 

TAGCAGATGGCAGCATCATGAATGAAACTGATTTGGCAGGTCCAATGGTTTTTTGCCTTG 

CTTTTGGAGCCACATTGCTACTGGCTGGCAAAATCCAGTTTGGCTATGTATACGGGATCA 

GTGCAATTGGATGTCTAGGAATGTTTTGTTTATTAAACTTAATGAGTATGACAGGTGTTT 

CATTTGGTTGTGTGGCAAGTGTCCTTGGATATTGTCTTCTGCCCATGATCCTACTTTCCA 

GCTTTGCAGTGATATTTTCTTTGCAAGGAATGGTAGGAATCATTCTCACTGCTGGGATTA 

TTGGATGGTGTAGTTTTTCTGCTTCCAAAATATTTATTTCTGCATTAGCCATGGAAGGAC 

AGCAACTTTTAGTAGCATATCCTTGCGCTTTGTTATATGGAGTCTTTGCCCTGATTTCCG 

TCTTTTGAAAATTTATCTGGGATGTGGACATCAGTGGGCCAGATGTACAAAAAGGACCTT 

GAACTCTTAAATTGGACCAGCAAACTGCTGCAGCGCAACTCTCATGCAGATTTACATTTG 

ACTGTTGGAGCAATGAAAGTAAACGTGTATCTCTTGTTCATTTTTATAGAACTTTTGCAT 

ACTATATTGGATTTACCTGCGGTGTGACTAGCTTTAAATGTTTGTGTTTATACAGATAAG 

AAATGCTATTTCTTTCTGGTTCCTGCAGCCATTGAAAAACCTTTTTCCTTGCAAATTATA 

ATGTTTTTGATAGATTTTTATCAACTGTGGGAAACCAAACACAAAGCTGATAACCTTTCT 

TAAAAACGACCCAGTCACAGTAAAGAAGACACAAGACGGCCGGGCGTGGTAGCTCACGCC 

TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACAAGGGCAGGAGATCGAGAC 

CATCCTGGTTAACACGGTGAAACCCCGACTCTACTAAAACTACAAAAAAAATTAGCTGGG 

CGTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAAGTGTGAA 

CCCAGGAGGCGGAGCTTGCAGTGAGCCGAGATCACACCACTGCACTCCATCCAGCCTGGG 

TGACAGGGTGAGACTCTGTCTCAAAAAAAAAAAAAAAAGGAGACACAAGACTTACTGCAA 

AAATATTTTTCCAAGGATTTAGGAAAGAAAAATTGCCTTGTATTCTCAAGTCAGGTAACT 

CAAAGCAAAAAAGTGATCCAAATGTAGAGTATGAGTTTGCACTCCAAAAATTTGACATTA 

CTGTAAATTATCTCATGGAATTTTTGCTAAAATTCAGAGATACGGGAAGTTCACAATCTA 

CCTCATTGTAGACATGAAATGCGAACACTTACTTACATATTAATGTTAACTCAACCTTAG 

GGACCTGGAATGGTTGCATTAATGCTATAATCGTTGGATCGCCACATTTCCCAAAAATAA 

TAAAAAAATCACTAACCTTTTTTAAGGAAAATATTTAAAGTTTTACAAAATTCAATATTG 

CAATTATCAATGTAAAGTACATTTGAATGCTTATTAAAACTTTCCCAATTAATTTT 
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FIGURE 12 



MSGFENLNTDFyQTSYSIDDQSQQSYDYGGSGGPYSKQYAGYDYSQQGRFVPPDMMQPQQ 
PYTGQIYQPTQAYTPASPQPFYGNNFEDEPPLLEELGINFDHIWQKTLTVLHPLKVADGS 
IMNETDLAGPMVFCLAFGATLLLAGKIQFGYVYGISAIGCLGMFCLLNLMSMTGVSFGCV 
AS VLGYCLLPMILLSSFAVI FSLQGMVGI ILTAGI IGWCSFSASKIFI SALAMEGQQLLV 
AYPCALLYGVFALI SVF 



signal sequence 
none 



Transmembrane domain 

129-149 

160-180 

177-197 

193-213 

235-255 



N-glycosylation site. 
123-126 

N-myristoylation site. 
32-37 
119-124 
174-179 
178-183 
208-213 

Yipl domain 
96-240 



WO 2004/081199 



13/73 



PCT/US2004/007862 



FIGURE 13A 



AGACAAAGCGGTCGCCGCCCCCGCCCGGCCCCCTGGTCTCTGTCTCCGTCCCTCCTCCTT 

TGCTGCCTCTTTCCCTCCTCCTCTCCCTCCCTCCTCCCCTCCCTCCAGTCTCCGGATCTC 

CCTCGGTCCCTCTCTCCTCCTCTTCCTCTCTCTGGACGCCCGGCTCCTCCGCACCCCCTC 

CCCCGGGGGTCCCGCGGCCTGTGAGTTGACTGAGGGGCTCAGACTTGGGGAGTGGGTGTC 

TCCTCGCCCCTGTCCTTGCTCCCGTCCCTGGCCCGGACCTTGGCTGTCTCCTCTTTGTGC 

CGAGATTGTCAGTCTGTGCGGCTACAGCGGGGTGGAGACGGCCGGCTCTGTCACGGCTTC 

ATGAGAGCGGGGACGGGGCGCAGGACTTGCAGGCGCCGGGGAGAAGAGACATGGAGCCGG 

CCCTTGGCACTCTGGGGTCGCGTGGGGCAGTCGGTGGGGGAGGCAGGCGGTGGTGACAGG 

ACAGGGTGGGGGTGGACGCCAGGGTTCTGGGAACGCGCTGGCAGCCCTGACGCCCGGGTT 

CCGAAAGTCTCGGGGGTGGGTATTTCCCCCGACCCGCCTCGGGGGCGGAGTGCGGGGCAG 

AGGGGTGGGGGCTGGGGAGAGGCGTGGCCCGAGCGGTGCTGGAAGCGGAGCCGGGACCTT 

TGGGGCCCGCGCTGAGACGCGCCCGGCTGCTGCC-5CCGCCCTCCTTTCCCCTCTTCCCTG 

GTTTCCCTTCTCCTCTAGACCTGTTCGGTCTCCGOCCCTCCTTGCCTCCCCAACACCCCC 

TCAGGTCCCGTTGCCTCCTGGTCCTTTCAGGGATTCCTGGTCCTTCCTTCCCACACTAGC 

CTCCCTGGGGTATCGCTGAGGCAGCCTGGCCTGCACCCAGGTTCCCCTCACCCCTGCCAC 

ATTTCTCTCTTCTCCCTCACGCCAACTTTCCTTTTCGCCCTTCTCTCTCTTTCTCACATC 

CTAGAGACGGTCTTTAATACGCATTAACCCTGTGCTGCCACATCTGGCTCCTGCCCTCAT 

TGCCTCCAATCCGGACTCTTCCTCTCACATCACCCCCACCACCCCCAACTTGGGCTCACA 

ACTTCTCTTCACTTTTTCCATTTCCCCAGTTCTCTGCCTTCCGTCTTTCCCTCTGTCCTC 

ATCCTTAGCCCCTCTGCCCTGCTTTGTGTCCCACCTCTCCCCCTCCACTTCCTCTCCTCC 

CACCCTCAGTCTCACCCCCGGGCTGTCTCACTCTCTGGAGCCTCTCCTTCCTGTTCTCTG 

TCCCCAGTGCTCCCTACCCTCACCTCAAGACGACCATGGCCACCATCCCAGACTGGAAGC 

TACAGCTGCTAGCCCGGCGCCGGCAGGAGGAGGCGTCCGTTCGAGGCCGAGAGAAAGCAG 

AACGGGAGCGCCTGTCCCAGATGCCAGCCTGGAAACGAGGGCTCCTGGAGCGCCGCCGGG 

CCAAGCTTGGGCTGTCCCCTGGGGAGCCTAGCCCTGTGCTAGGGACTGTAGAGGCTGGAC 

CTCCAGACCCGGATGAGTCTGCGGTCCTTCTGGAGGCCATCGGGCCAGTGCACCAGAACC 

GATTCATCCGGCAGGAGCGGCAGCAGCAGCAGCAGCAACAACAACGGAGTGAAGAGCTGC 

TAGCAGAGAGAAAGCCTGGGCCTCTGGAGGCCCGGGAGCGGAGACCCAGCCCTGGGGAGA 

TGCGGGATCAGAGCCCCAAGGGAAGAGAGTCAAGAGAAGAGAGACTAAGTCCGAGGGAGA 

CCAGAGAGAGGAGGCTGGGGATAGGGGGAGCCCAAGAGTTGAGCCTGAGGCCTCTGGAGG 

CTCGGGACTGGAGGCAAAGCCCAGGAGAGGTGGGAGACAGGAGCTCCCGACTGTCAGAGG 

CATGGAAATGGAGGCTGAGTCCTGGAGAAACTCCAGAGCGGAGTCTGAGACTAGCAGAGT 

CTCGAGAGCAAAGCCCCAGGAGAAAAGAGGTGGAAAGTAGACTGAGCCCAGGGGAATCTG 

CCTACCAGAAGTTGGGCCTGACAGAGGC CCATAAATGGAGACCTGACTC CAGAGAGTCTC 

AGGAACAGAGTTTGGTACAACTGGAGGCAACAGAGTGGAGGCTGAGGTCAGGAGAAGAAA 

GACAAGACTACTCGGAAGAATGTGGGAGAAAAGAAGAGTGGCCAGTTCCAGGGGTAGCTC 

CAAAAGAGACTGCAGAGCTGTCCGAGACCCTGACAAGGGAGGCCCAAGGCAACAGTTCCG 

CAGGAGTGGAGGCAGCAGAGCAGAGGCCTGTGGAAGATGGCGAGAGGGGCATGAAGCCAA 

CAGAAGGGTGGAAATGGACCCTGAACTCCGGGAAGGCTCGAGAATGGACACCCAGGGACA 

TAGAGGCTCAAACTCAGAAACTAGAACCTCCAGAGTCAGCAGAGAAGCTTCTGGAATCTC 

CCGGTGTGGAGGCTGGAGAAGGGGAGGCTGAGAAGGAGGAGGCGGGGGCTCAGGGCAGGC 

CTCTGAGAGCCCTGCAGAACTGCTGCTCTGTGCCCTCCCCCCTCCCACCAGAGGACGCTG 

GGACTGGAGGCCTGAGACAGCAGGAAGAGGAAGCAGTGGAGCTCCAGCCCCCACCACCAG 

CCCCTCTGTCTCCCCCACCCCCAGCCCCAACTGCCCCCCAACCTCCTGGGGATCCCCTCA 

TGAGCCGCCTGTTCTATGGGGTGAAGGCAGGGCCAGGGGTGGGGGCCCCCCGCCGCAGTG 

GACACACCTTCACCGTCAACCCCCGGCGGTCTGTGCCCCCTGCGACCCCAGCCACCCCAA 

CCTCTCCAGCCACAGTTGATGCTGCAGTCCCGGGGGCTGGGAAGAAGCGGTACCCAACTG 

CCGAGGAGATCTTGGTTCTGGGGGGCTACCTCCGTCTCAGCCGCAGCTGCCTTGCCAAGG 

GGTCCCCCGAAAGACACCACAAACAGCTTAAGATCTCCTTCAGCGAGACAGCCCTGGAGA 

CCACGTACCAATACCCCTCCGAGAGTTCGGTACTGGAGGAGCTGGGCCCGGAGCCTGAGG 

TCCCCAGTGCCCCCAACCCTCCAGCAGCCCAACCCGACGACGAAGAGGATGAGGAAGAGC 

TGCTGCTGCTGCAGCCAGAGCTCCAGGGCGGGCTGCGCACCAAGGCCCTGATTGTGGATG 

AGTCCTGCCGGCGGTGACCATCTTCCAACATAGGGATATACCTCCCTCCTTCTTATAACT 

GAAGATCCTGGAGCCCGGAAGATTCAGGGCAGACAGACCCTGATAATGAGCCTGGCAGGG 

AAGGGCAACCAACATCTTGTAACTTGCTTTCCCCACCCTGTTTCTGGGGGCAGAGCCAAT 

TGCCCAATTTCTACCCTAATCCAAAGTCCCTGGTGTGGGTGGGGTTAAACGTGCTGGTGC 

ATCCTAGGTCATCCAAGAGTGAGCGCCAAGTCCTGAGAAGGGGCACAGAACTCCCTGGAG 
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FIGURE 13B 



GGTGGAGATGGAGCACCTGCCCCCCATGGCAGGGTACACTCTCCCCACAGCCTTCCTCCC 
CACCATCCCGTGGGGACTCTCGGGATTTAAGCACTCGTCTCTCTGGGAGGCCCAGACCCC 
ACTCCATTTATAGGCACATCTCCTTCATTTCCTAGGTCACTGCCCCTTTGTTTACAGCTC 
CTGCCTCCTCCCTTGACCACAGCCTGGTTTACAAATTCCATCAGCTCCCAGCCCCACCTG 
CCAAAGTCCCAGGTTTACAAGCCACGCTTACTTGCTGTGTCTGCGTGGAATTCTCTCCTC 
TGTCCCCTCCAGTCTCCTCATTGGAGTGACCTGAAGGTGTGGCTTCCTCCACTTTTTCTC 
AGTATTACTTTGCCTTAGTTTTCCCCAAGAGGGAAGGCTGGAACTCTTAACTCTGTACCC 
CTTGATAGTTATTTAATTCTGTTTCTCCTAGTGGTTCACAATTGAACTGAATTGAGATGG 
TGTCGGGTGGCTAAGGAGACACCTCACCTCTCCTTCCCCATTGTGCCGCCTTTATCAATT 
GCCTGTTTTGTTTTGTTTGTTTTTTAACTTTCCATAATAAAATGGAGTTCTCTTC 
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"""""""""" ~ :eahk 

;stlt 



R^QGNSSAGVEAAEQRPVEDGERGMKPTEGWKWTLNS^ 



signal sequence 
none 

transmembrane domain 
none 

N-glycosylation site. 
306-309 



CAMP i2l-i25 GMP " dePendent protein kin * se Phosphorylation site 



Tyrosine^kinase phosphorylation site 

N-myristoylation site. 

52-57 

63-68 
155-160 
234-239 
305-310 
310-315 
325-330 
370-375 
598-603 

Amidation site. 
276-279 
501-504 
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FIGURE 15 



GCGGCTCTCTGATCCAGCCCGGGAGAGGACCGAGCTGGAGGAGCTGGGTGTGGGGTGCGT 
TGGGCTGGTGGGGAGGCCTAGTTTGGGTGCAAGTAGGTCTGATTGAGCTTGTGTTGTGCT 
GAAGGGACAGCCCTGGGTCTAGGGGAGAGAGTCCCTGAGTGTGAGACCCGCCTTCCCCGG 
TCCCAGCCCCTCCCAGTTCCCCCAGGGACGGCCACTTCCTGGTCCCCGACGCAACC^TGG 
CTGAAGAACAACCGCAGGTCGAATTGTTCGTGAAGGCTGGCAGTGATGGGGCCAAGATTG 
GGAACTGCCCATTCTCCCAGAGACTGTTCATGGTACTGTGGCTCAAGGGAGTCACCTTCA 
ATGTTACCACCGTTGACACCAAAAGGCGGACCGAGACAGTGCAGAAGCTGTGCCCAGGGG 
GGCAGCTCCCATTCCTGCTGTATGGCACTGAAGTGCACACAGACACCAACAAGATTGAGG 
AATTTCTGGAGGCAGTGCTGTGCCCTCCCAGGTACCCCAAGCTGGCAGCTCTGAACCCTG 
AGTCCAACACAGCTGGGCTGGACATATTTGCCAAATTTTCTGCCTACATCAAGAATTCAA 
ACCCAGCACTCAATGACAATCTGGAGAAGGGACTCCTGAAAGCCCTGAAGGTTTTAGACA 
ATTACTTAACATCCCCCCTCCCAGAAGAAGTGGATGAAACCAGTGCTGAAGATGAAGtlTG 
TCTCTCAGAGGAAGTTTTTGGATGGCAACGAGCTCACCCTGGCTGACTGCAACCTGT1GC 
CAAAGTTACACATAGTACAGGTGGTGTGTAAGAAGTACCGGGGATTCACCATCCCCGAGG 
CCTTCCGGGGAGTGCATCGGTACTTGAGCAATGCCTACGCCCGGGAAGAATTCGCTTCCA 
CCTGTCCAGATGATGAGGAGATCGAGCTCGCCTATGAGCAAGTGGCAAAGGCCCTCAAAT 
AAGCCCCTCCTGGGACTCCCTCAACCCCCTCCATTTTCTCCACAAAGGCCCTGGTGGTTT 
CCACATTGCTACCCAATGGACACACTCCAAAATGGCCAGTGGGCAGGGAATCCTGGAGCA 
CTTGTTCCGGGATGGTGTGGTGGAAGAGGGGATGAGGGAAAGAAATGGGGGGCCTGGGTC 
AGATTTTTATTGTGGGGTGGGGTGAGTAGGACAACATATTTCAGTAATAAAATACAGAAT 
AAAAATCAAGTGTTTTTAAAAAAAA 
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LPKLHIVQWCKKYRGPTIPEAFRGVHRYLSNAYAREEFASTCPDDEEIELAYEQVAKAL 

Signal sequence 
none 



Transmembrane domain 
22-42 

N-glycosylation site. 
42-45 

CAMP- and cGMP-dependent protein kinase phosphorylation site 

4 9-52 



N-myristoylation site. 

15-20 

18-23 

38-43 
132-137 
170-175 



WO 2004/081199 



18/73 



PC17US2004/007862 
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CATCAGCTTTGAAAGCCAACACATCCTCCTGAGAGGGGACAAGACAAGCAGGGATATGTG 

GGCCACTGGATCTTTGCCAGACTTCCCGGCTGCAGCCAAGTTCTTAGGGTTCCGTCAGCG 

CTGCATCrCCAGGAGCCTCTGCCTCAGTGAGTGTCCTCTGGAGCCCCCAAGCCTCACCCG 

CCTCTGTGCCACTCTGAAGGACTGCCCGGGACCCCTGGAACTGCAATTGTCCTGTGAGTT 

CCTGAGTGACCAGAGCCTGGAGACTCTACTGGACTGCTTACCTCAACTCCCTCAGCTGAG 

CCTGCTGCAGCTGAGCCAGACGGGACTGTCCCCGAAAAGCCCCTTCCTGCTGGCCAACAC 

CTTAAGCCTGTGTCCACGGGTTAAAAAGGTGGATCTCAGGTCCCTGCACCATGCAACTTT 

GCACTTCAGATCCAACGAGGAGGAGGAAGGCGTGTGCTGTGGTCTCTCAGCAAACCTGCT 

GGGCGACAGCGGACTCAGATGCCTTCTGGAATGTCTGCCGCAGGTGCCCATCTCCGGTTT 

GCTTGATCTGAGTCACAACAGCATTTCTCAGGAAAGTGCCCTGTACCTGCTGGAGACACT 

GCCCTCCTGCCCACGTGTCCGGGAGGCCTCAGTGAACCTGGGCTCTGAGCAGAGCTTCCG 

GATTCACTTCTCCAGAGAGGACCAGGCTGGGAAGACACTCAGGCTAAGTGAGTGCAGCTT 

CCGGCCAGAGCACGTGTCCAGGCTGGCCACCGGCTTGAGCAAGTCCCTGCAGCTGACGGA 

GCTCACGCTGACCCAGTGCTGCCTGGGCCAGAAGCAGCTGGCCATCCTCCTGAGCTTGGT 

GGGGCGACCCGCAGGGCTGTTCAGTCTCAGGGTGCAGGAGCCGTGGGCGGACAGAGCCAG 

GGTTCTCTCCCTGTTAGAAGTCTGCGCCCAGGCCTCAGGCAGTGTCGCTGAAATCAGCAT 

CTCCGAGACCCAGCAGCAGCTCTGTGTCCAGCTGGAATTTCCTCGCCAGGAAGAGAATCC 

AGAAGCTGTGGCACTCAGGTTGGCTCACTGTGACCTTGGAGCCCACCACAGCCTTCTTGT 

CGGGCAGCTGATGGAGACATGTGCCAGGCTGCAGCAGCTCAGCTTGTCTCAGGTTAACCT 

CTGTGAGGACGATGATGCCAGTTCCCTGCTGCTGCAGAGCCTCCTGCTGTCCCTCTCTGA 

GCTGAAGACATTTCGGCTGACCTCCAGCTGTGTGAGCACCGAGGGCCTCGCCCACCTGGC 

ATCTGGTCTGGGCCACTGCCACCACTTGGAGGAGCTGGACTTGTCTAACAATCAATTTGA 

TGAGGAGGGCACCAAGGCGCTGATGAGGGCCCTTGAGGGGAAATGGATGCTAAAGAGGCT 

GGACCTCAGTCACCTTCTGCTGAACAGCTCCACCTTGGCCTTGCTTACTCACAGACTAAG 

CCAGATGACCTGCCTGCAGAGCCTCAGACTGAACAGGAACAGTATCGGTGATGTCGGTTG 

CTGCCACCTTTCTGAGGCTCTCAGGGCTGCCACCAGCCTAGAGGAGCTGGACTTGAGCCA 

CAACCAGATTGGAGACGCTGGTGTCCAGCACTTAGCTACCATCCTGCCTGGGCTGCCAGA 

GCTCAGGAAGATAGACCTCTCAGGGAATAGCATCAGCTCAGCCGGGGGAGTGCAGTTGGC 

AGAGTCTCTCGTTCTTTGCAGGCGCCTGGAGGAGTTGATGCTTGGCTGCAATGCCCTGGG 

GGATCCCACAGCCCTGGGGCTGGCTCAGGAGCTGCCCCAGCACCTGAGGGTCCTACACCT 

ACCATTCAGCCATCTGGGCCCAGGTGGGGCCCTGAGCCTGGCCCAGGCCCTGGATGGATT 

CCCCCATTTGGAAGAGATCAGCTTGGCGGAAAACAACCTGGCTGGAGGGGTCCTGCGTTT 

CTGTATGGAGCTCCCGCTGCTCAGACAGATAGACCTGGTTTCCTGTAAGATTGACAACCA 

GACTGCCAAGCTCCTCACCTCCAGCTTCACGAGCTGCCCTGCCCTGGAAGTAATCTTGCT 

GTCCTGGAATCTCCTCGGGGATGAGGCAGCTGCCGAGCTGGCCCAGGTGCTGCCGCAGAT 

GGGCCGGCTGAAGAGAGTGGACCCGGAGAAGAATCAGATCACAGCTTTGGGGGCCTGGCT 

CCTGGCTGAAGGACTGGCCCAGGGGTCTAGCATCCAAGTCATCCGCCTCTGGAATAACCC 

CATTCCCTGCGACATGGCCCAGCACCTGAAGAGCCAGGAGCCCAGGCTGGACTTTGCCTT 

CTTTGACAACCAGCCCCAGGCCCCTTGGGGTACTTGATGGCCCCCTCAAGACCTTTGGAA 

TCCAGCCAAGTGATGCACCCAAATGATCCACCTTTCGCCCACTGGGATAAATGACTCAGG 

AAAGAAGAGCCTCGGCAGGGCGCTCTGCACTCCACCCAGGAGGAAGGATACGTGTGTCCT 

GCTGCAGTCCTCAGGGAGAACTTTTTTGGGAACCAGGAGCTGGGTCTGGACAAAGGAGTA 

CCCTGCATTACGTGGGATATGTGTGATCAATTGGGGACATGCGACACACAATGAGGGTGT 

CATGACAATGCATGACACGTACGGTTATATGTGGCAGTGTGACCCCTTGACATGTGGCGT 

TACATGAAAGTCAGTGTGGCACGTGTTCTGTGGCATGGGTGCTGGCATCCCAAGTGGCAG 

GATACATGATTGTTGGTCTATATATGACACATGACAAATGTCCATGTCACAGGACTCATG 

GCTGGCCAGATGACCTCAGGCTGGCCCAAGATCTAATTTATTAATTTTTAAAGCAAATAC 

ATATTTATAGATTGTGTGTATGGAGCAGCTAAGTCAGGAAAAGTCTTCCGCCCGAGCTGG 

GAGGGGAGAGTGTCCATGCACTGACCAGTCCAGGGGCTCAAGGGCCAGGGCTCTGGAACA 

AGCCAGGGACTCAGCCATTAAGTCCCCTCCTGCCTCAATCCTCAGCCTACCCATCTATAA 

ACTTGATGACTCCTCCCTTACTTACATACTAGCTTCCAAGGACAGGTGGAGGTAGGGCCA 

GCCTGGCGGGAGTGGAGAAGCCCAGTCTGTCCTATGTAAGGGACAAAGCCAGGTCTAATG 

GTACTGGGTAGGGGGCACTGCCAAGACAATAAGCTAGGCTACTGGGTCCAGCTACTACTT 

TGGTGGGATTCAGGTGAGTCTCCATGCACTTCACATGTTACCCAGTGTTCTTGTTACTTC 

CAAGGAGAACCAAGAATGGCTCTGTCACACTCGAAGCCAGGTTTGATCAATAAACACAAT 

GGTATTCC 
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FIGURE 18 



MWATGSLPDFPAAAKFLGFRQRCIPRSLCLSECPLEPPSLTRLCATLKDCPGPLELQLSC 
EFLSDQSLETLLDCLPQLPQLSLLQLSQTGLSPKSPFLLANTLSLCPRVKKVDLRSIjHHA 
TLHFRSNEEEEGVCCGLSANLLGDSGIiRCLLECLPQVPISGLLDLSHNSISQESALYLLE 
TLPSCPRVREAS'VNLGSEQSFRIHFSREDQAGKTLRLSECSFRPEHVSRLATGLSKSLQL 
TELTLTQCCLGQKQIiAILIiSLVGRPAGLFSLRVQEPWADRARVLSLLEVCAQASGSVAEI 
SISETQQQLCVQLEFPRQEENPEAVALRLAHCDLGAHHSLLVGQLMETCARLQQLSLSQV 
NLCEDDDASSLLLQSLLLSLSELKTFRLTSSCVSTEGLAHLASGLGHCHHLEELDLSNNQ 
FDEEGTKALMRALEGKWMLKRLDLSHLLLNSSTLALLTHRLSQMTCLQSLRLNRNSIGDV 
GCCHLSEALRAATSLEELDLSHNQIGDAGVQHLATILPGLPELRKIDLSGNSISSAGGVQ 
LAESLVLCRRLEELMLGCNALGDPTALGLAQELPQHLRVLHLPFSHLGPGGALSLAQALD 
GFPHLEEISLAENNLAGGVLRFCMELPLLRQIDLVSCKIDNQTAKLLTSSFTSCPALEVI 
LLSWNijLGDEAAAEIiAQVLPQMGRLKRVDPEKNQI TALGAWLLAEGIjAQGS s iqvi RLWN 
NPIPCDMAQHLKSQEPRLDFAFFDNQPQAPWGT 

Signal sequence 
None 

Transmembrane domain 

654-674 
694-714 

N-glycosylation site. 
450-453 
644-644 

Glycosaminoglycan attachment site. 

403- 406 

N-myristoylation site. 

132-137 
136-141 
196-201 
233-238 
335-340 

404- 409 
530-535 
538-543 
590-595 
706-711 

Leucine zipper pattern. 
84-105 

Leucine Rich Repeat 

80-107 

239-262 

410-434 

438-462 

466-493 

494-521 

522-546 

550-578 

604-627 

628-655 

656-680 

712-732 
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ATAGGGTCAGTGGGCCGCTTGGCGGTGTCGTTGCGGTACCAGGTCCGCGTGAGGGGTTCG 

GGGGTTCTGGGCAGGCACAATGGCGTCTCGAGCAGGCCCGCGAGCGGCCGGCACCGACGG 

CAGCGACTTTCAGCACCGGGAGCGCGTCGCCATGCACTACCAGATGAGTGTGACTCTCAA 

GTATGAAATCAAGAAGCTGATCTACGTACATCTGGTCATATGGCTGCTGCTGGTTGCTAA 

GATGAGCGTGGGACACCTGAGGCTCTTGTCACATGATCAGGTGGCCATGCCCTATCAGTG 

GGAATACCCGTATTTGCTGAGCATTTTGCCCTCTCTCTTGGGCCTTCTCTCCTTTCCCCG 

CAACAACATTAGCTACCTGGTGCTCTCCATGATCAGCATGGGACTCTTTTCCATCGCTCC 

ACTCATTTATGGCAGCATGGAGATGTTCCCTGCTGCACAGCAGCTCTACCGCCATGGCAA 

rrrOTACCGTTTCCTCTTTGGTTTTTCTGCCGTTTCCATCATGTACCTGGTGTTGGTGTT 

GGCAGTGCAAGTGCATGCCTGGCAGTTGTACTACAGCAAGAAGCTCCTAGACTCTTGGTT 

(^CC^GCACACAGGAGAAGAAGCATAAMGAAGCCTCTTTGGGGTGAAGCCTGGACATCC 

rATCGAATGAAAGGACACTAGTACAGCGGTTCCAAAATCCCTTCTGGTGATTTTAGCAGC 

TGTGATGTTGGTACCTGGTGCAGACCAGGCCAAAGTTCTGGAAAGCTCCTTTTGCCATCT 

GCTGAGGTGGCAAAACTATAATTTATTCCTGGTTGGCTAGAACTGGGTGACCAACAGCTA 

TGAAACAAATTTCAGCTGTTTGAAGTTGAACTTTGAGGTTTTTCTTTAAGAATGAGCTTC 

GTCCTTGCCTCTACTCGGTCATTCTCCCCATTTCCATCCATTACCCCTTAGCCATTGAGA 

CTAAAGGAAATAGGGAATAAATCAAATTACTTCATCTCTAGGTCACGGGTCAGGAAACAT 

TTGGGCAGCTGCTCCCTTGGCAGGCTGTGGTCTCCTCTGCAAAGCATTTTAATTAAAAAC 

CTCAATAAAGA 
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FIGURE 20 



MASRAGPRAAGTDGSDFQHRERVAMHYQMSVTLKYEIKKLIYVHLVIWLLLVAKMSVGHL 
RLLSHDQVAMPYQWEYPYLLSILPSLLGLLSFPRNNISYLVLSMISMGLFSIAPLIYGSM 
EMFPAAQQL YRHGKAYRFLFGFSAVS I MYLVLVLAVQVHAWQLYYS KKLLDSWFTSTQEK 
KHK 



signal sequence 
none 

Transmembrane domain 

38-58 

77-97 

99-119 
138-158 

N-glycosylation site. 
96-99 



N-myristoylation site. 
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FIGURE 21 



GACGTGAACGGTCGTTGCAGAGATTGCGGGCGGCTGAGACGCCGCCTGCCTGGCACCTAG 

GAGCGCAGCGGAGCCCCGACACCGCCGCCGCCGCCATSGAGTCCGAGACCGAACCCGAGC 

CCGTCACGCTCCTGGTGAAGAGCCCCAACCAGCGCCACCGCGACTTGGAGCTGAGTGGCG 

ACCGCGGCTGGAGTGTGGGCCACCTCAAGGCCCACCTGAGCCGCGTCTACCCCGAGCGTC 

CGCGTCCAGAGGACCAGAGGTTAATTTATTCTGGGAAGCTGTTGTTGGATCACCAATGTC 

TCAGGGACTTGCTTCCAAAGCAGGAAAAACGGCATGTTTTGCATCTGGTGTGCAATGTGA 

AGAGTCCTTCAAAAATGCCAGAAATCAACGCCAAGGTGGCTGAATCCACAGAGGAGCCTG 

CTGGTTCTAATCGGGGACAGTATCCTGAGGATTCCTCAAGTGATGGTTTAAGGCAAAGGG 

AAGTTCTTCGGAACCTTTCTTCCCCTGGATGGGAAAACATCTCAAGGCCTGAAGCTGCCC 

AGCAGGCATTCCAAGGCCTGGGTCCTGGTTTCTCCGGTTACACACCCTATGGGTGGCTTC 

AGCTTTCCTGGTTCCAGCAGATATATGCACGACAGTACTACATGCAATATTTAGCAGCCA 

CTGCTGCATCAGGGGCTTTTGTTCCACCACCAAGTGCACAAGAGATACCTGTGGTCTCTG 

CACCTGCTCCAGCCCCTATTCACAACCAGTTTCCAGCTGAAAACCAGCCTGCCAATCAGA 

ATGCTGCTCCTCAAGTGGTTGTTAATCCTGGAGCCAATCAAAATTTGCGGATGAATGCAC 

AAGGTGGCCCTATTGTGGAAGAAGATGATGAAATAAATCGAGATTGGTTGGATTGGACCT 

ATTCAGCAGCTACATTTTCTGTTTTTCTCAGTATCCTCTACTTCTACTCCTCCCTGAGCA 

GATTCCTCATGGTCATGGGGGCCACCGTTGTTATGTACCTGCATCACGTTGGGTGGTTTC 

CATTTAGACCGAGGCCGGTTCAGAACTTCCCAAATGATGGTCCTCCTCCTGACGTTGTAA 

ATCAGGACCCCAACAATAACTTACAGGAAGGCACTGATCCTGAAACTGAAGACCCCAACC 

ACCTCCCTCCAGACAGGGATGTACTAGATGGCGAGCAGACCAGCCCCTCCTTTATGAGCA 

CAGCATGGCTTGTCTTCAAGACTTTCTTTGCCTCTCTTCTTCCAGAAGGCCCCCCAGCCA 

TCGCAAACTGATGGTGTTTGTGCTGTAGCTGTTGGAGGCTTTGACAGGAATGGACTGGAT 

CACCTGACTCCAGCTAGATTGCCTCTCCTGGACATGGCAATGATGAGTTTTTAAAAAACA 

GTGTGGATGATGATATGCTTTTGTGAGCAAGCAAAAGCAGAAACGTGAAGGCGTGATACA 

AATTGGTGAACAAAAAATGCCCAAGGCTTCTCATGTCTTTATTCTGAAGAGCTTTAATAT 

ATACTCTATGTAGTTTAATAAGCACTGTACGTAGAAGGCCTTAGGTGTTGCATGTCTATG 

CTTGAGGAACTTTTCCAAATGTGTGTGTCTGCATGTGTGTTTGTACATAGAAGTCATAGA 

TGCAGAAGTGGTTCTGCTGGTACGATTTGATTCCTGTTGGAATGTTTAAATTACACTAAG 

TGTACTACTTTATATAATCAATGAAATTGCTAGACATGTTTTAGCAGGACTTTTCTAGGA 

AAGACTTATGTATAATTGCTTTTTAAAATGCAGTGCTTTACTTTAAACTAAGGGGAACTT 

TGCGGAGGTGAAAACCTTTGCTGGGTTTTCTGTTCAATAAAGTTTTACTATGAATGACCC 

TGAAAAAAAAAAAAAAAAAAAAAA 
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MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDORLTY^P 
Sn^?^ QCLRDLLPKQE ^ H ^ HLVC1 ^ PSraPEINA ^AESTEEPAGSNRGQYPEDS 

signal sequence 



Transmembrane domain 
278-298 

N-glycosylation site 
133-136 
141-144 



Tyrosine kinase phosphorylation site 
262-269 

N-myristoylation site. 
110-115 
154-159 
239-244 



Ubiquitin family 
10-89 
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CGAGMGCGGGTCATGGCGCCCCGAACCCTCATCCTGCTGCTCTCGGGAGCCCTGGCCCT 

GACCGAGACCTGGGCCTGCTCCCACTCCATGAGGTATTTCGACACCGCCGTGTCCCGGCC 

CGGCCGCGGAGAGCCCCGCTTCATCTCAGTGGGCTACGTGGACGACACGCAGTTCGTGCG 

GTTCGACAGCGACGCCGCGAGTCCGAGAGGGGAGCCCCGGGCGCCGTGGGTGGAGCAGGA 

GGGGCCGGAGTATTGGGACCGGGAGACACAGAAGTACAAGCGCCAGGCACAGGCTGACCG 

AGTGAACCTGCGGAAACTGCGCGGCTACTACAACCAGAGCGAGGACGGGTCTCACACCCT 

CCAGTGGATGTATGGCTGCGACCTGGGGCCCGACGGGCGCCTCCTCCGCGGGTATGACCA 

GTCCGCCTACGACGGCAAGGATTACATCGCCCTGAACGAGGACCTGCGCTCCTGGACCGC 

CGCGGACACGGCGGCTCAGATCACCCAGCGCAAGTGGGAGGCGGCCCGTGAGGCGC-AGCA 

GTGGAGAGCCTACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGGAGAACGG 

GAAGGAGACGCTGCAGCGCGCGGAACACCCAAAGACACACGTGACCCACCATCCCGTCTC 

TGACCATGAGGCCACCCTGAGGTGCTGGG-CCTGGGCTTCTACCCTGCGGAGATCACACT 

GACCTGGCAGCGGGATGGCGAGGACCAAACTCAGGACACCGAGCTTGTGGAGACCAGGCC 

AGCAGGAGATGGAACCTTCCAGAAGTGGGCAGCTGTGGTGGTGCCTTCTGGAGAAGAGCA 

GAGATACACGTGCCATGTGCAGCACGAGGGGCTGCCAGAGCCCCTCACCCTGAGATGGGA 

GCCATCTTCCCAGCCCACCATCCCCATCGTGGGCATCGTTGCTGGCCTGGCTGTCCTGGC 

TGTCCTAGCTGTCCTAGGAGCTGTGATGGCTGTTGTGATGTGTAGGAGGAAGAGCTCAGG 

TGGAAAAGGAGGGAGCTGCTCTCAGGCTGCGTCCAGCAACAGTGCCCAGGGCTCTGATGA 

GTCTCTCATCGCTTGTAAAGCCTGAGACAGCT 
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FIGURE 24 



MRVMAPRTLILLLSGALALTETWACSHSMRYFDTAVSRPGRGEPRFISVGYVDDTQPVRF 
DSDAAS PRGE PRAPWVEQEGPE YWDRETQKYKRQAQADRVNLRKLRGYYNQS EDGSHTLQ 
WMYGCDLGPDGRLLRGYDQSAYDGKDYIALNEDLRSWTAADTAAQITQRKWEAAREAEQW 
RAYLEGTCVEWLRRYLENGKETLQRAEHPKTHVTHHPVSDHEATLRCWALGFYPAEITLT 
WQRDGEDQTQDTELVETRPAGDGTFQKWAAVWPSGEEQRYTCHVQHEGLPEPLTLRWEP 
SSQPTIPIVGIVAGLAVLAVLAVLGAVMAVVMCRRKSSGGKGGSCSQAASSNSAQGSDES 
LIACKA 



Signal sequence 
1-24 

Transmembrane domain 
308-328 



N-glycosylation site. 
110-113 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

334- 337 

335- 338 



Tyrosine kinase phosphorylation site. 
135-142 

N-myristoylation site. 
310-315 
325-330 
339-344 
342-347 
356-361 



Microbodies C-terminal targeting signal. 
364-367 

Immunoglobulins and major histocompatibility complex proteins signature. 
281-287 

Class I Histocompatibility antigen 
25-203 

Immunoglobulin domain 
220-285 
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FIGURE 25 



ACTCCCAACGAGCGCCCAAGAAGAAAATGGCCATAAGTGGAGTCCCTGTGCTAGGATTTT 

TCATCATAGCTGTGCTGATGAGCGCTCAGGAATCATGGGCTATCAAAGAAGAACATGTGA 

TCATCCAGGCCGAGTTCTATCTGAATCCTGACCAATCAGGCGAGTTTATGTTTGACTTTG 

ATGGTGATGAGATTTTCCATGTGGATATGGCAAAGAAGGAGACGGTCTGGCGGCTTGAAG 

AATTTGGACGATTTGCCAGCTTTGAGGCTCAAGGTGCATTGGCCAACATAGCTGTGGACA 

AAGCCAACCTGGAAATCATGACAAAGCGCTCCAACTATACTCCGATCACCAATGTACCTC 

CAGAGGTAACTGTGCTCACGAACAGCCCTGTGGAACTGAGAGAGCCCAACGTCCTCATCT 

GTTTCATCGACAAGTTCACCCCACCAGTGGTCAATGTCACGTGGCTTCGAAATGGAAAAC 

CTGTCACCACAGGAGTGTCAGAGACAGTCTTCCTGCCCAGGGAAGACCACCTTTTCCGCA 

AGTTCCACTATCTCCCCTTCCTGCCCTCAACTGAGGACGTTTACGACTGCAGGGTGGAGC 

ACTGGGGCTTGGATGAGCCTCTTCTCAAGCACTGGGAGTTTGATGCTCCAAGCCCTCTCC 

CAGAGACTACAGAGAACGTGGTGTGTGCCCTGGGCCTGACTGTGGGTCTGGTGGGCATCA \ 

TTATTGGGACCATCTTCATCATCAAGGGAGTGCGCAAAAGCAATGCAGCAGAACGCAGGG 

GGCCTCTG TAA GGCACATGGAGGTGATGATGTTTCTTAGAGAGAAGATCACTGAAGAAAC 

TTCTGCTTTAATGACTTTACAAAGCTGGCAATATTACAATCCTTGACCTCAGTGAAAGCA 

GTCATCTTCAGCGTTTTCCAGCCCTATAGCCACCCCAAGTGTGGTTATGCCTCCTCGATT 

GCTCCGTACTCTAACATCTAGCTGGCTTTCCCTGTCTATTGCCTTTTCCTGTATCTATTT 

TCCTCTATTTCCTATCATTTTATTATCACCATGCAATGCCTCTGGAATAAAACATACAGG 

AGTCTGTCTCTGCTATGGAATGCCCCATGGGGCTCTCTTGTGTACTTATTGTTTAAGGTT 

TCCTCAAACTGTGATTTTTCTGAACACAATAAACTATTTTGATGATCTTGGGTGGAAAA 
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FIGURE 26 



MAI SGVPVLGFFI IAVLMSAQESWAIKEEHVI IQAEFYLNPDQSGEFMFDFDGDEIFHVD 
MAKKETWRLEEFGRFASFEAQGALANIAVDKAl^EIMTKRSNYTPITNVPPEVTVLTNS 
PVELREPNVLICFIDKFTPPVVNVTWLRNGKPVTTGVSETVFLPREDHLFRKFHYLPFLP 
STEDVYDCRVEHWGLDEPLLKHWEFDAPSPLPETTEIvTVVCALGLTVGLVGIIIGTIFIIK 
GVRKSNAAERRGPL 



Signal sequence 



Transmembrane domain 
218-238 



N-glycosylation site. 
143-146 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
63-66 

N-myristoylation site. 
83-88 
156-161 
223-228 
230-235 
241-246 

Immunoglobulins and major histocompatibility complex proteins signature. 
186-192 

Class II histocompatibility antigen 
29-109 

Immunoglobulin domain 
125-190 
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FIGURE 27 



ATGGCGCCCCGAAGCCTCCTCCTGCTGCTCTCAGGGGCCCTGGCCCTGACCGATACTTGG 

GCGGGCTCCCACTCCTTGAGGTATTTCAGCACCGCTGTGTCGCGGCCCGGCCGCGGGGAG 

CCCCGCTACATCGCCGTGGAGTACGTAGACGACACGCAATTCCTGCGGTTCGACAGCGAC 

GCCGCGATTCCGAGGATGGAGCCGCGGGAGCCGTGGGTGGAGCAAGAGGGGCCGCAGTAT 

TGGGAGTGGACCACAGGGTACGCCAAGGCCAACGCACAGACTGACCGAGTGGCCCTGAGG 

AACCTGCTCCGCCGCTACAACCAGAGCGAGGCTGGGTCTCACACCCTCCAGGGAATGAAT 

GGCTGCGACATGGGGCCCGACGGACGCCTCCTCCGCGGGTATCACCAGCACGCGTACGAC 

GGCAAGGATTACATCTCCCTGAACGAGGACCTGCGCTCCTGGACCGCGGCGGACACCGTG 

GCTCAGATCACCCAGCGCTTCTATGAGGCAGAGGAATATGCAGAGGAGTTCAGGACCTAC 

CTGGAGGGCGAGTGCCTGGAGTTGCTCCGCAGATACTTGGAGAATGGGAAGGAGACGCTA 

CAGCGCGCAGATCCTCCAAAGGCACACGTTGCCCACCACCCCATCTCTGACCATGAGGCC 

ACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCGGAGATCACGCTGACCTGGCAGCGG 

GATGGGGAGGAACAGACC CAGGACACAGAGCTTGTGGAGACCAGGC CTGCAGGGGATGGA 

ACCTTCCAGAAGTGGGCCGCTGTGGTGGTGCCTTCTGGAGAGGAACAGAGATACACATGC 

CATGTGCAGCACGAGGGGCTGCCCCAGCCCCTCATCCTGAGATGGGAGCAGTCTCCCCAG 

CCCACCATCCCCATCGTGGGCATCGTTGCTGGCCTTGTTGTCCTTGGAGCTGTGGTCACT 

GGAGCTGTGGTCGCTGCTGTGATGTGGAGGAAGAAGAGCTCAGATAGAAACAGAGGGAGC 

TACTCTCAGGCTGCAGTCACTGACAGTGCCCAGGGCTCTGGGGTGTCTCTCACAGCTAAT 

AAAGTGTGAGACAGCTTCCTTGTGTGGGACTGAGAAGCAAGATATCAATGTAGCAGAATT 

GCACTTGTGCCTCACGAACATACATAAATTTTAAAAATAAAGAATAAA 
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FIGURE 28 



MAPRSLLLLLSGALALTDTWAGSHSLRYFSTAVSRPGRGEPRYIAVEYVDDTQFLRFDSD 
AAI PRMEPRE PWVEQEGPQYWEWTTGYAKANAQTDRVALRNLLRRYNQSEAGSHTLQGMN 
GCDMGPDGRLLRGYHQHAYDGKDY I SLNEDLRSWTAADTVAQI TQRFYEAEEYAEEFRTY 
LEGECLELLRRYLENGKETLQRADPPKAHVAHHPI SDHEATLRCWALGFYPAEI TLTWQR 
DGEEQTQDTELVETRPAGDGTFQKWAAVWPSGEEQRYTCHVQHEGLPQPLILRWEQSPQ 
PT I P I VG I VAGLVVLGAVVTGAWAAVMWRKKS SDRNRGS YSQAAVTDSAQGSGVSLTAN 
KV 



signal sequence 
1-17 

* 

Transmembrane domain 
304-324 

N-glycosylation site. 
107-110 

cAMP- and cGMP- dependent protein kinase phosphorylation site. ■ 

330- 333 

331- 334 

Tyrosine kinase phosphorylation site. 
166-173 

N-myristoylation site. 
118-123 
307-312 
316-321 
321-326 
352-357 
354-359 

Microbodies C-terminal targeting signal. 
360-363 

Crystallins beta and gamma 'Greek key' motif signature. 
131-146 

Immunoglobulins and major histocompatibility complex proteins signature. 
2 78-2 84 

Class I Histocompatibility antigen, domains 
22-200 

Immunoglobulin domain 
217-282 
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FIGURE 29 



CAAGG ATG GTGGTCATGGCGCCCCGAACCCTCTTCCTGCTGCTCTCGGGGGCCCTGACCC 
TGACCGAGACCTGGGCGGGCTCCCACTCCATGAGGTATTTCAGCGCCGCCGTGTCCCGGC 
CCGGCCGCGGGGAGCCCCGCTTCATCGCCATGGGCTACGTGGACGACACGCAGTTCGTGC 
GGTTCGACAGCGACTCGGCGTGTCCGAGGATGGAGCCGCGGGCGCCGTGGGTGGAGCAGG 
AGGGGCCGGAGTATTGGGAAGAGGAGACACGGAACACCAAGGCCCACGCACAGACTGACA 
GAATGAACCTGCAGACCCTGCGCGGCTACTACAACCAGAGCGAGGCCAGTTCTCACACCC 
TCCA.GTGGATGATTGGCTGCGACCTGGGGTCCGACGGACGCCTCCTCCGCGGGTATGAAC 
AGTATGCCTACGATGGCAAGGATTACCTCGCCCTGAACGAGGACCTGCGCTCCTGGACCG 
CAGCGGACACTGCGGCTCAGATCTCCAAGCGCAAGTGTGAGGCGGCCAATGTGGCTGAAC 
AAAGGAGAGCCTACCTGGAGGGCACGTGCGTGGAGTGGCTCCACAGATACCTGGAGAACG 
GGAAGGAGATGCTGCAGCGCGCGGACCCCCCCAAGACACACGTGACCCACCACCCTGTCT 
\ TTGACTATGAGGCCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCGGAGATCATAC 
TGACCTGGCAGCGGGATGGGGAGGACCAGACCCAGGACGTGGAGCTCGTGGAGACCAGGC 
CTGCAGGGGATGGAACCTTCCAGAAGTGGGCAGCTGTGGTGGTGCCTTCTGGAGAGGAGC 
AGAGATACACGTGCCATGTGCAGCATGAGGGGCTGCCGGAGCCCCTCATGCTGAGATGGA 
AGCAGTCTTCCCTGCCCACCATCCCCATCATGGGTATCGTTGCTGGCCTGGTTGTCCTTG 
CAGCTGTAGTCACTGGAGCTGCGGTCGCTGCTGTGCTGTGGAGAAAGAAGAGCTCAGATT 
GAAAAGGAGGGAGCTACTCTCAGGCTGCAATGTGAAACAGCTGCCCTGTGTGGGACTGAG 
TGGCAAGTCCCTTTGTGACTTCAAGAACCCTGACTTCTCTTTGTGCAGAGACCAGCCCAC 
CCCTGTGCCCACCATGACCCTCTTCCTCATGCTGAACTGCATTCCTTCCCCAATCACCTT 
TCCTGTTCCAGAAAAGGGGCTGGGATGTCTCCGTCTCTGTCTCAAATTTGTGGTCCACTG 
AGCTATAACTTACTTCTGTATTAAAATTAGAATCTGAGTAT 
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FIGURE 30 



MVVMAPRTLFLLLSGALTLTETWAGSHSMRYFSAAVSRPGRGEPRFIAMGYVDDTQFVRF 
DSDSACPRMEPRAPWVEQEGPEYWEEETRNTKAHAQTDRMNLQTLRGYYWQSEASSHTLQ 
WMIGCDLGSDGRLLRGYEQYAYDGKDYLALNEDLRSWTAADTAAQISKRKCEAANVAEQR 
RAYLEGTCVEWLHRYLENGKEMLQRADPPKTHVTHHPVFDYEATLRCWALGFYPAEIILT 
WQRDGEDQTQDVELVETRPAGDGTFQKWAAVWPSGEEQRYTCHVQHEGLPEPLMLRWKQ 
SSLPT I P IMGI VAGLWLAAVVTGAAVAAVLWRKKS SD 



signal sequence 
1-24 

Transmembrane domain \ 
306-326 

N-glycosylation site. 
110-113 

♦ 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

333- 336 

334- 337 

Tyrosine kinase phosphorylation site. 
135-142 

N-myristoylation site. 
124-129 
310-315 
324-239 

Immunoglobulins and major histocompatibility complex proteins signature. 
281-287 

Class I Histocompatibility antigen 
25-203 

Immunoglobulin domain 
220-285 
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FIGURE 31A 



GTAAGAGTCGAGGCCTGAGTCGCAGTAGAAGCCGAAGTAGGGGGCGCAGCAAAGACCGGG 

ATCCAAATAGGAATGTTGAGCACAGGGAAAGATCGAAGTTTAAGAGTGAAAGGAATGACC 

TGGAGAGTTCCTATGTGCCTGTGTCTGCACCACCTCCAAACTCTTCTGAGCAGTATTCCT 

CTGGGGCACAGTCTATTCCCAGCACTGTTACTGTGATCGCACCTGCTCACCACTCTGAAA 

ACACAACTGAGAGTTGGTCTAATTACTATAACAATCATAGCTCTTCCAATTCTTTTGGTC 

GAAACCTACCACCAAAGAGGCGATGCAGAGATTATGATGAAAGAGGATTTTGTGTACTTG 

GTGACCTTTGTCAGTTTGATCATGGAAATGATCCCCTAGTTGTTGATGAAGTTGCTCTGC 

CAAGTATGATTCCTTTCCCACCCCCTCCTCCTGGGCTTCCTCCTCCACCACCTCCTGGAA 

TGTTAATGCCTCCAATGCCAGGTCCAGGCCCAGGCCCGGGCCCAGGTCCAGGCCCAGGCC 

CGGGCCCAGGTCCAGGTCCTGGCCATAGTATGAGACTTCCTGTTCCCCAAGGACATGGTC 

AGCCTCCACCATCCGTTGTGCTTCCCATACCAAGACCACCTATAACACAATCAAGCTTGA 

TAAACAGCCGTGACCAGCCTGGGACAAG'^GCAGTGCCCAATCTTGCATCAGTGGGAACAA 

GACTACCTCCTCCTTTACCCCAGAACCTCCTTTACACAGTATCAGAACGACAGCCCATGT 

ACTCTCGTGAACATGGTGCTGCTGCATCTGAGCGACTTCAGTTGGGGACACCGCCTCCTC 

TGTTGGCAGCTCGTTTGGTGCCACCTCGAAACCTCATGGGATCCTCCATTGGATACCATA 

CCTCAGTCTCCAGCCCTACCCCTCTGGTTCCAGATACATATGAACCAGATGGTTACAACC 

CAGAAGCTCCTAGTATTACTAGTTCTGGTAGATCTCAGTACAGACAGTTCTTTTCAAGAA 

CTCAGACACAGCGTCCCAATCTGATTGGCCTAACATCTGGAGATATGGATGTAAATCCAA 

GAGCTGCTAACATTGTGATCCAGACTGAACCACCAGTTCCTGTTTCGATTAATAGCAACA 

TAACCAGAGTAGTTCTTGAACCAGATAGTCGAAAAAGAGCTATGAGTGGTTTGGAAGGGC 

CACTCACAAAGAAACCTTGGCTGGGAAAGCAAGGAAATAACAATCAAAATAAACCAGGGT 

TCTTACGAAAGAATCAGTATACAAACACCAAATTAGAAGTCAAGAAAATCCCTCAGGAAT 

TGAACAACATTACCAAGCTCAATGAACACTTCAGCAAATTTGGAACTATTGTTAATATCC 

AGGTTGCTTTTAAGGGTGACCCAGAAGCAGCCCTAATCCAATATCTTACCAATGAGGAGG 

CCAGGAAAGCCATTTCTAGCACAGAAGCAGTTCTAAACAACCGATTCATTCGAGTCTTGT 

GGCATAGGGAAAATAATGAGCAACCGACACTACAGTCCTCAGCACAGCTGCTCCTGCAAC 

AACAGCAAACACTTAGTCACCTCTCACAGCAGCACCATCACCTGCCACAGCATCTACATC 

AGCAGCAGGTGCTAGTGGCCCAGTCTGCTCCTTCAACAGTGCACGGAGGTATCCAGAAGA 

TGATGAGCAAACCACAGACATCAGGTGCATATGTTCTTAACAAAGTTCCTGTTAAACATC 

GTCTTGGACATGCAGGTGGTAACCAGAGTGATGCATCACATTTGTTGAATCAGTCTGGTG 

GTGCTGGAGAAGATTGCCAGATATTTTCAACTCCAGGCCATCCAAAAATGATTTACAGCT 

CCTCAAACTTAAAGACACCTTCAAAGCTCTGTTCAGGGTCTAAATCTCATGATGTTCAAG 

AAGTGCTTAAAAAAAAACAGGAAGCAATGAAGTTACAACAAGATATGAGGAAAAAAAGAC 

AGGAAGTGTTAGAAAAGCAAATAGAATGCCAAAAGATGTTAATATCCAAGTTAGAAAAAA 

ACAAAAACATGAAACCAGAAGAAAGAGCAAATATAATGAAGACTTTGAAAGAGCTTGGAG 

AGAAGATCTCACAATTAAAAGATGAATTAAAAACATCTTCTGCAGTCTCCACACCATCTA 

AAGTGAAGACAAAAACGGAGGCCCAGAAGGAGTTATTAGATACTGAACTGGACCTCCACA 

AGAGGCTGTCCTCAGGAGAAGACACCACAGAATTACGGAAAAAACTCAGTCAGTTACAGG 

TTGAGGCTGCACGGTTAGGTATTTTACCTGTGGGTCGAGGAAAGACCATGTCCTCTCAAG 

GTCGAGGAAGAGGCCGAGGGCGTGGAGGAAGAGGAAGGGGCTCACTAAATCACATGGTGG 

TGGACCATCGTCCCAAAGCACTAACAGTTGGAGGATTCATTGAGGAAGAAAAAGAAGACT 

TGCTTCAGCATTTCTCAACCGCAAACCAAGGGCCAAAATTTAAAGACCGTCGGCTACAGA 

TATCATGGCACAAGCCCAAGGTACCATCTATATCCACTGAGACTGAAGAAGAAGAAGTCA 

AGGAGGAGGAAACAGAAACCTCAGATTTGTTTTTGCCTGATGATGACGATGAAGATGAAG 

ATGAATATGAGTCTCGCTCATGGCGAAGATGAAATCTGATGCTAGCTGTATAATTTTTAG 

GAATATTGTTTAGAAGAACAACTTTTAAAAATTATTTAAAAGAAGTCAATGAGCCAAAAA 

AAATTTTTTTATTTTTCTTTTCAACACAGTAGGTTCAAGAACAGCAAGTTTGCTATTTAA 

ACACATCTCATAACTGTACATGATATTAAAGCACCAAAGGCCTAGTGACTTTTACACAGT 

TGTGAAGATCCACAGCAATGACATGGATAATCTCTAGAGCCTTATTTTCCACACCACCTT 

TTTTTTGTTGCTGTTGGTGTTGGATTATGATGTAAATGACAGGGTGTTCAGAAATTTTAT 

TTTGGATTATAATCTACTGATAAAATTTAATTAAATGTCAAAATCACCTGTAATCTTTTA 

ACTCTCAGTTGTTATGGATGGGTCATCAGACAGTAGGAAATATTTGTAATTAAAATACCC 

TTTTGCTTTCAAGAGTGGTCTTGGAAGAAAAATATGTTTATTTTAATTTTAATTTTTTTT 

TTTTTTTACTTTGGAGGAGCCCAATTTGTATTCAGTCTAAATTTGTTTTTCATGGTTTTT 

ATGTGTGGTAGTACTAGTTCAGTGAAAATCACAACAAACTGTGTGTTCTTAAATGCTACA 

GCAATGGTAGCTAGCTCTATAGTATTTCCATTCTCTTGTCATATGAACTCCTGGACTTTT 
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FIGURE 31B 



CCTGCTACTGATTCTTAGTTTGCTTTGATATTTAGTGAGCGCTGGTTCGGCTGGTCTTTT 

TAAAAATTTTTATAACTCAGAATGTAAAAGGCTTTCACCTATTGCCCCTTTCTCGTCCCC 

CTTTTCTTCCTCTTGCCTTTCTATTAATAGGCCTGACTTAGTCATCATTGATATAAGTGG 

TTCATTTGGGCAACCGCAGGCAGTCTCCAATAAAAGGTTCAAACCAGCACCTGGATAAAT 

AAGTGATGTTTTGATGAGAATGATGGAAGGATTTTGGGGAGAAGAGAATCACCAGCTTTA 

TTTTTTCCCTCTTTTGAAGCTTCAGTTCAGTATAGTTACACAGAAGGGTTTAAAAATGTC 

TGAAATTCTTTTCCTAAATTTAAACCATGATGGTGAGGTGACTGGGGGGTGGGGGGTGGA 

GGTGGGAGGGGGTTGAGGGCTGAGTATGCTAGTAATAAATGGGAGTCATATATGACTTTG 

TTCAGATGACCTTATTTTTTCATAATTTTGTAGCTTGTTGCTATCCATGAACATACGCAT 

CCCTCAGTAGACAGTTTCCTTTACTGTGTGCATTTTTACTGTACACTGTATAGAGTAGAG 

TATCTGTTAAGTAAAATATATATGTAAATTTATGTCTTCGTGTTCTGAATGTTAGATGGA 

AAAGCAGAGGAGCAACACTACACTGTACTGACCCCTGGGAAAATAAAATGATTAATGTAC 

CTAGGCTGTCACTTTTTCAAGGATATACACATCTGTGTTTTTATATATGTGTGGTATGAA 

AATTTGAAACCTCCTGTTGTCTCCAAAAGGGTCTTGGTGACCACTATATCAAGGATTTTG 

TTTGCTGAGGTAAAGACAATTTCTCAATATAGAAACTATTCACTAAGAACTTAAAACCAT 

CAGGATTCTAGATTTACTTAGCTATAAGAAGTTAAATCCCACGTTCTGATAATCTTTATT 

CTTCTACATGTTTTCCTGGAATGATGTGTCAAACCAGGGTCATTTTTAAGTTCATATAAT 

TTTCTTTCATAATAATGAGGACTTCCTTTTTCTAGGTTATTGTTATATTTCTAGTAACAC 

TGAAGGTAAAATAAGCTTATAAAAGAAGGAAAACATCAGAATGGTGTATCTAGAATACAC 

AATCTTGCTCCTTACTCCTCCATCATCTCTAATGAAATTTTTCTTGATGTCTGTGGCTCA 

GTCAGTATATATTTTGAAAGATTCATTGTGGTGAATATTTTGAGTCCTGACAGTTTAAAC 

GTGATAGAGGGAACAAAGGATTTATATATTTTTTGTACAAAGTTTTTTCTTAAACTGTAT 

AATTTTGTAAATAATTTGTTTGGGTTTTTTTTTTGTGTACTAAAACTACCAGCGTATAGA 

TTTCAATAAGAATTGTTGTATTTTTGTATTTGGAAACTGAAAATTACAGTAAATAAATGG 

AACTCTAAGAAAAATTTCAGTAGACAGTTCAACAGAATGAGATTCCTTTTCATATGATTT 

TTTTGACCTCAAAATGGACATCTTATTTCACTCTCAGTAAAAAGTTGAAAATTTGACTTT 

TTAGTATTGGGGGCAGAAGATAGAAGTGTGTTAAAGTGAAAAACAGTTTTTTCAGTAAGG 

TTTGAGGCCCTTTCTATAAGTTTTCAGTACACCTTAAACAGGACTAAAAATACTTTAAAA 

GTGTGCTTTCAAAGAATTTTAAAACATGTTTGACTCAAAGATGAATTTGAAGTGAGTTAA 

TTATAATACTGGAAAACTTCAGGTTAGCATAAAATAGTTTATATATATCCCCTGCCCCCT 

TTGGGATTTACTTCCACCATCAAATCTTCAACTGTTAGAATTTCACTTAAAAGTGAATAA 

CTGCCATACATGTTTTCTTGGCATCTTCATTTACAGAAATAACATACAGCATCTAAAGCA 

AGTTCTGTCCCTCTTAATGTGTATGACATCTTTTTAAGTAGCTGATCAGCAATAATAGCC 

ACCAGTTGACTAGGTCAGCCTGATATACTAGTGGATCACTGTCCAAGATAAAAAGGTAAC 

CCCAAGTATTGCAAAATTTCCCTCAATTTCTTTATGTAACATTGAAGAAATGTTGTTACC 

AGCTAAGATGATAAATTGGAAAACAGTCTTGTTAAGGTGAAAGGGGTCCTTGGTGATGGA 

TCATTATTTCAAAAAAGAAAAAAATCATGTGAAAAGGAATAGTGGTTCCTCTTGATGTAT 

AGTATGCCTGTATTATAGTTTTTACAAAATTGTGAACTTGTGTAATAGTATAATAGGAGA 

TATTGTTGAATTTCTAACTGTTTATACATTTAAATTCATATATGTAATGTTTGTTTTATC 

AATTACAATCTGAATTTCTAAGAAGCATGTTGACTTTTGCAATAAAGATGCAATATGGAA 

TGGTTCACAATTGG 
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FIGURE 32 



MIPFPPPPPGLPPPPPPGMLMPPMPGPGPGPGPGPGPGPGPGPGPGHSMPJhPVPQGHGQP 
PPSWLPIPRPPITQSSLINSRDQPGTSAVPNLASVGTRLPPPLPQNLLYTVSERQPMYS 
REHGAAASERLQLGTPPPLLAARLVPPRNLMGSSIGYHTSVSSPTPLVPDTYEPDGYNPE 
APSITSSGRSQYRQFFSRTQTQRPNLIGLTSGDMDVNPRAA^TIVIQTEPPVPVSINSNIT 
RVVLEPDSRKRAMSGLEGPLTKKPWLGKQGNlSnsrQNKPGFLRKNQYTNTKLEVKKIPQELN 
NITKLNEHFSKFGTIVNIQVAFKGDPEAALIQYLTNEEARKAISSTEAVLNNRFIRVLWH 
RENNEQPTLQSSAQLLLQQQQTLSHLSQQHHHLPQHLHQQQVLVAQSAPSTVHGGIQKMM 
SKPQTSGAYVLNKVPVKHRLGHAGGNQSDASHLLNQSGGAGEDCQIFSTPGHPKMIYSSS 
NLKTPSKLCSGSKSHDVQEVLKKXQEAMKLQQDMRKKRQEVLEKQIECQKMLISKLEKNK 
NMKPEERANIMKTLKELGEKISQLKDELKTSSAVSTPSKVKTKTEAQKELLDTELDLHKR 
LSSGEDTTELRKKLSQLQVEAARLGILPVGRGKTMSSQGRGRGRGRGGRGRGSLNHMVVD 
HRPKALTVGGFIEEEKEDLLQHFSTANQGPKFKDRRLQISWHKPKVPSISTETEiJEEVKE 
EETETSDLFLPDDDDEDEDEYESRSWRR \ 

Signal sequence 
none 

Transmembrane domain 
none 

N-glycosylation site. 
238-241 
301-304 
446-449 
455-458 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
599-602 
612-615 

N-myristoylation site. 
124-129 
152-157 
208-213 
313-318 
444-449 



WO 2004/081199 



35/73 



PCT/US2004/007862 



FIGURE 33 

AAACACTCTGTGTGGCTCCTCGGCTTTGGGACAGAGTGCAAGACGaTGACTTGCAAAATG 
TCGCAGCTGGAACGCAACATAGAGACCATCATCIAACACCTTCCACCAA 

ctggggcacccagacaccctgaaccagggggaattcaaagagctggtccgaaSgmctg 

CTGGACACAAATGCAGACAAGCAGCTGAGCTTCGAGGAGTTCATCATGCTGATGGCGAGG 

CTAACCTGGGCCTCCCACGAGAAGATGCACGAGGGTGACGAGGGCCCTGGCCAC^ACCAT 

^^ C ^^ TCGGGGAGGG ^CCCCCTMGACCACAGTGGCCAAGATCACAGTGGCCAC 
GG ^^S C ^ CAGTCATGGTGGCCACGGCCACAGGC ^CTAATCAGGAGGCC^^ 

CCTGCCTCTACCCAACCAGGGCCCCGGGGCCTGTTATGTCAAACTGTCTTGGCTGTGGGG 
CTAGGGGCTGGGGCCAAATAAAGTCTCTTCCTCCAA GGCTGTGGGG 



WO 2004/081199 



36/73 



PCT/US2004/007862 



FIGURE 34 

MTCKMSQLERNIETIINTPHQYSVKLGHPDTLNQGEFKELWKDLQNFLKKENKNEKVIE 
HIMEDLDTNADKQLSFEEFIMLMARLTWASHEKMHEGDEGPGHHHKPGLGEGTP 



signal sequence 
none 

Transmembrane domain 
none 

N-myristoylation site. 
108-113 

S-100/lCaBP type calcium binding domain 
8-51 

EF hand 
58-86 
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FIGURE 35 

AAGAGGAAGCGCTGGCAGACAATGCGACCCGACCGCGCTGAGGCTCCAGGACCGCCCGCC 
ATGGCTGCAGGAGGTCCCGGCGCGGGGTCTGCGGCCCCGGTCTCCTCCACATCCTCCCTT 
CCCCTGGCTGCTCTCAACATGCGAGTGCGGCGCCGCCTGTCTCTGTTCTTGAACGTGCGG 
ACACAGGTGGCGGCCGACTGGACCGCGCTGGCGGAGGAGATGGACTTTGAGTACTTGGAG 
ATCCGGCAACTGGAGACACAAGCGGACCCCACTGGCAGGCTGCTGGACGCCTGGCAGGGA 
CGCCCTGGCGCCTCTGTAGGCCGACTGCTCGAGCTGCTTACCAAGCTGGGCCGCGACGAC 
GTGCTGCTGGAGCTGGGACCCAGCATTGAGGAGGATTGCCAAAAGTATATCTTGAAGCAG 
v CAGCAGGAGGAGGCTGAGAAGCCTTTACAGGTGGCCGCTGTAGACAGCAGTGTCCCACGG 
ACAGCAGAGCTGGCGGGCATCACCACACTTGATGACCCCCTGGGGCATATGCCTGAGCGT 
TTCGATGCCTTCATCTGCTATTGCCCCAGCGACATCCAGTTTGTGCAGGAGATGATCCGG 
CAACTGGAACAGACAAACTATCGACTGAAGTTGTGTGTGTCTGACCGCGATGTCCTGCCT 
GGCACCTGTGTCTGGTCTATTGCTAGTGAGCTCATCGAAAAGAGGTGCCGCCGGATGGTG 
GTGGTTGTCTCTGATGATTACCTGCAGAGCAAGGAATGTGACTTCCAGACCAAATTTGCA 
CTCAGCCTCTCTCCAGGTGCCCATCAGAAGCGACTGATCCCCATCAAGTACAAGGCAATG 
AAGAAAGAGTTCCCCAGCATCCTGAGGTTCATCACTGTCTGCGACTACACCAACCCCTGC 
ACCAAATCTTGGTTCTGGACTCGCCTTGCCAAGGCCTTGTCCCTGCCCTGA 
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MAAGGPGAGSAAPVSSTSSLPLAALNMRVRRRLSLFLNVRTQVAADWTALAEEMDFEYLE 
IRQLETQADPTGRLLDAWQGRPGASVGRLLELLTKLGRDDVLLELGPS I EEDCQKYILKQ 
QQEEAEKPLQVAAVDSSVPRTAELAGITTLDDPLGHMPERFDAFICYCPSDIQFVQEMIR 
QLEQTMYRLKLCVSDRDVLPGTCWSIASELIEKRCRRiVr^ArVVSDDYLQSKECDFQTKFA 
LSLSPGAHQKRLIPIKYKAMICKEFPSILRFITVCDYTNPCTKSWFWTRLAKALSLP 



signal sequence 

1-45 ': 

Transmembrane domain 
none 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 
31-34 

Tyrosine kinase phosphorylation site. 
180-187 

N-myristoylation site. 
4-9 
7-12 
83-88 

Growth factor and cytokines receptors family signature 1. 
192-205 

TIR domain 
163-292 

Death domain 
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GAATTCGGCACGAGTGCCCAGCTCCTGCTGTAATTAGCTCCACGTGTACCCCCTTCATTC 
CCTCCCTCCCACCGAGCCATCCCTGACCCAGGAACTTTCCGCAGACTCGCCGCCATCTGG 
GAGTGAAGCAACATGGATGCAGTCAGCCAAGTCCCCATGGAAGTCGTGCTTCCCAAGCAC 
ATCCTGGATATCTGGGTTATTGTCCTCATCATCCTGGCCACCATTGTCATCATGACCTCG 
TTGTTGCTGTGCCCAGCCACTGCAGTAATCATCTATCGCATGCGGACTCATCCGATCCTT 
AGTGGGGCTGTTTGAGAGCCTCCCAAGAGGGCCGGGTGAGGGATGAGGACAGGCATCCTA 
TCCCCAGCCTCTTCCTGTCTTCAGAAAAGCAGCAGGAGGGACTTTGGGGCATGGACCTGA 
GTTCTGGTTTTGATTCTGCCACGAGCCAGCTGTGTGAATTTGGTCAAGGGACCTAACTCT 
CTGAGTTCCAGGTTCCTTATCTTTCAAATGGGGATGGTGATCCCTGCCCTTTCTACCTCA 
TAGGGATGTGAGAACCACCTGACTTAGTGGATGTGAAAGCTGTTTGTGATCAGTAAAGCT 
ACCACAGATATAAGGGTGTTATGCTGAA*.7CCTGAGAAGCTTTCAAGAACCAGAGAACCTG 
ATTGCTGATGATGGCCTTAAAGGTGGTGAGGGAGATACTGGGGGCAGAGCAGACTTTGCC 
AGTGCCCCTCAGGTCAAACCAAGCCAAGAGCACCCTGTCCCCATTCCAAGGGGCCAGCAG 
CACTTTGGCCCAAAGTATTTTCTTTAAGGTGCCATTCCTTCATGTTTTCTCAGTTTGGAG 
GGTGATGGGTAGAGCTTTCCAGAACCTTCTCCATTCCAGAATCTCTGCCCCTGTGTAATC 
TGAAGGAAGGCTGTGCCATCTTTGGGCACTGCCAAGGGAGTTGGGGTGATGGGCTTCTTT 
CTGCACTGGAGTCTCACATCTGTTAGCTTTGACACTCAAGCAATGTTGGAAAATGCAGGG 
TGACTGAGTTCCCTGCCCAGCTTTCGGGATCTCTGGCCCCCATCCCCTTGTGTGTGTCCC 
TCTGCCCAGCTCCTGCTGTAATTAGCTCCACGTGTACCCCCTTCACTCCCTCCCACCAGC 
TCTGCAGCCAGCCTATGGCAATTATATTTTAAGAGGTGTTCCCAGGACTTTTGGGACCTA 
CTAAAACAATGATGGTTATTTTAGATGTGATGATTTATATTTATGTAGAGATATTTCTGG 
ACCACTCAAGCTCTTCGATACCAAAATCAGGAGCATCTTGGGATTTATTAAATTATGTAA 
GAAGATAGCACAGATATCGGGATATTATTGTGTGAAAATGCTGCTTTTACTTTGATGTGA 
TCTCATTGATGTACACAACCAAGTTCCAATAAAGTGCTAGAATGTGAAAAAAAAAAAAAA 
AAAACTGCGAGGGGGGGACCCGTAACCCTAATCGACCTTAATGAGTGTA 
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MDAVSQVPMEWLPKHILDIWVIVLIILATIVIMTSLLLCPATAVIIYRMRTHPILSGAV 

Signal sequence 
1-44 

Transmembrane domain 
None 
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CTGGAGCAGCTGAAACCGGTTTGAGCGTGGCTGCTTCCTGCCGCTCGACGCCGCGGCAGG 
CCGCCTGGGGGAGCGCTGGCGAGGCACGGACGGCGGGCGCCCGGTACCTCTGCCCGCGGT 
CCTCGCTCTCGGGCGGGGCGGCGGCGACGCGGACCTGCGGACTAGCGAACCCGGAGCACG 
ACATCATAAAATAAATCCATCAGAATGACACCTTCTCAGGTTGCCTTTGAAATAAGAGGA 
ACTCTTTTACCAGGAGAAGTTTTTGCGATATGTGGAAGCTGTGATGCTTTGGGAAACTGG 
AATCCTCAAAATGCTGTGGCTCTTCTTCCAGAGAATGACACAGGTGAAAGCATGCTATGG 
AAAGCAACCATTGTACTCAGTAGAGGAGTATCAGTTCAGTATCGCTACTTCAAAGGGTAC 
TTTTTAGAACCAAAGACTATCGGTGGTCCATGTCAAGTGATAGTTCACAAGTGGGAGACT 
CATCTACAACCACGATCAATAACCCCTTTAGAAAGCGAAATTATTATTGACGATGGACAA 
TTTGGAATCCACAATGGTGTTGAAACTCTGGATTCTGGATGGCTGACATGTCAGACTGAA 
ATAAGATTACGTTTGCATTATTCTGAAAAACCTCCTGTGTCAATAACCAAGAAAAAATTA 
AAAAAATCTAGATTTAGGGTGAAGCTGACACTAGAAGGCCTGGAGGAAGATGACGATGAT 
AGGGTATCTCCCACTGTACTCCACAAAATGTCCAATAGCTTGGAGA'l'ATCCTTAATAAGC 
GACAATGAGTTCAAGTGCAGGCATTCACAGCCGGAGTGTGGTTATGGCTTGCAGCCTGAT 
CGTTGGACAGAGTACAGCATACAGACGATGGAACCAGATAACCTGGAACTAATCTTTGAT 
TTTTTCGAAGAAGATCTCAGTGAGCACGTAGTTCAGGGTGATGCCCTTCCTGGACATGTG 
GGTACAGCTTGTCTCTTATCATCCACCATTGCTGAGAGTGGAAAGAGTGCTGGAATTCTT 
ACTCTTCCCATCATGAGCAGAAATTCCCGGAAAACAATAGGCAAAGTGAGAGTTGACTAT 
ATAATTATTAAGCCATTACCAGGATACAGTTGTGACATGAAATCTTCATTTTCCAAGTAT 
TGGAAGCCAAGAATACCATTGGATGTTGGCCATCGAGGTGCAGGAAACTCTACAACAACT 
GCCCAGCTGGCTAAAGTTCAAGAAAATACTATTGCTTCTTTAAGAAATGCTGCTAGTCAT 
GGTGCAGCCTTTGTAGAATTTGACGTACACCTTTCAAAGGACTTTGTGCCCGTGGTATAT 
CATGATCTTACCTGTTGTTTGACTATGAAAAAGAAATTTGATGCTGATCCAGTTGAATTA 
TTTGAAATTCCAGTAAAAGAATTAACATTTGACCAACTCCAGTTGTTAAAGCTCACTCAT 
GTGACTGCACTGAAATCTAAGGATCGGAAAGAATCTGTGGTTCAGGAGGAAAATTCCTTT 
TCAGAAAATCAGCCATTTCCTTCTCTTAAGATGGTTTTAGAGTCTTTGCCAGAAGATGTA 
GGGTTTAACATTGAAATAAAATGGATCTGCCAGCAAAGGGATGGAATGTGGGATGGTAAC 
TTATCAACATATTTTGACATGAATCTGTTTTTGGATATAATTTTAAAAACTGTTTTAGAA 
AATTCTGGGAAGAGGAGAATAGTGTTTTCTTCATTTGATGCAGATATTTGCACAATGGTT 
CGGCAAAAGCAGAACAAATATCCGATACTATTTTTAACTCAAGGAAAATCTGAGATTTAT 
CCTGAACTCATGGACCTCAGATCTCGGACAACCCCCATTGCAATGAGCTTTGCACAGTTT 
GAAAATCTACTGGGGATAAATGTACATACTGAAGACTTGCTCAGAAACCCATCCTATATT 
CAAGAGGCAAAAGCTAAGGGACTAGTCATATTCTGCTGGGGTGATGATACCAATGATCCT 
GAAAACAGAAGGAAATTGAAGGAACTTGGAGTTAATGGTCTAATTTATGATAGGATATAT 
GATTGGATGCCTGAACAACCT^AATATATTCCAAGTGGAGCAATTGGAACGCCTGAAGCAG 
GAATTGCCAGAGCTTAAGAGCTGTTTGTGTCCCACTGTTAGCCGCTTTGTTCCCTCATCT 
TTGTGTGGGGAGTCTGATATCCATGTGGATGCCAACGGCATTGATAACGTGGAGAATGCT 
TAGTTTTTATTGCACAGAGGTCATTTTGGGGGCGTGCACCGCTGTTCTGGGTATTCATTT 
TTCATCACTGAGCATTGTTGATCTATGCCTTTTGGGCTTCTCAGTTCAATGAAGCAATAA 
TGAAGTATTTAACTCTTTCACTACAGTTCTTGCAAGTATGCTATTTAAATTACTTGGCCA 
GGTATAATTGCCAGTCAGTCTCTTTATAGTGAGAAAATTTATTGGTTAGTAATATAAATA 
TTTTAAACTAAATATATAAATCTATAATGTTAAACATATGTTCATTAAAAGCATAGCACT 
TTGAAATTAACTATATAAATAGCTCATATTTACACTTACAGCTTTTCATTTGATCAGGTC 
TGAAATCTTTAGCACTTAAGGAAAATGACTATGCATAATTATACCTGACCATGAAAAAAA 
TAAGTACCTCAAATGCATGCATTTGCACTGGTGATTCCAACTGCACAAATCTTTGTGCCA 
TCTTGTATATAGGTATTTTTTACATGGGTTGACATGCACACAACACCATTTTCATTCAGT 
ATGAACCTTGAGGCTGCTGCCATTTTTCCACTTAACCAAACCAGCCTGAAGGTGAACCTC 
GAAACTTGTTTCATAAATCTTTCAAAAGTTGTTTTACATCAATGTTAAAATTTCAAAATG 
CTGCAGGGTAATTTAATGTATAAAATATTAGTAAGAAAAAGTATGTATTGCATACTTAGT 
AGAATAGATCACAACATACAAATTCAATTCAGTGCATGCTTTAGGTGTTAAGCATGAGAT 
TGTACATGTTTACTGTTAGGTCCTTGCATCTGTGGTGCTAGGTGAGTATGAGAAGATGTC 
AAGGACTGGACGTATTTTGTTGCCTAAAAAAAAAAGGCTGTTTGTAGGCGTTTTAAATAT 
GCTTATTTTGTGTGTCTCTCACTACCTATTACACACTGTTGCTTTGTGGGTTTGTTTTGT 
ATGTGCGTGTGTTATACAGTAGTTAAATTTCCATGCAGAAAAATAAATGTCCTGAATTCT 
CATATTAGTATTCTTTATTGTATATCATGCATGTAATTTATTTAGAAATGTAGGTCTTAC 
TAAATGTATATGCATGTATTTCAGATTATACTAGGATTTCTTGGATTAGAAGCTGATTGT 
GTTAACTGTAACTTAAAGAATGAATGTTAAATAAAATGATACAGATTTATTTTCTTCATT 
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ACAAAATGAAATTTCAAGAAGGTGTTACTTTTGTAGAATGGTTTTATAATATGACAAGAA 

ATTTTAATATAGTGTCTACCCTAAAGGGATGGCTTATTTGCATCTACCTTTTACTGCATG 

TTTTTCACAAGGCAGTTTATTCATATATTGACATATTTTGGTAGTAGCTGAGAACCTAAG 

ACTTGAAATTATACATTGTGTAGTATTTTTTAAGCTAAGCAATGCAATTTTGGTCAGATC 

TTATTTGTGTGAAGATAGGCTCTGAAATCCTATGGTATTGCGTTTGTAACGTTGATATTA 

ATGCAAAATAGTTTAGGAAATGGAGTCTTCTGCAAGGGTTCTGTATACTTTTCCCACATT 

GTATGAGATTTTCCAAAATTTTGGTGTGAATTGGGCACTTTTGGAAAACTCCTGAAAAAG 

AATTAGTTTCCTTCATCTGCAGACCTTTGTCCAATACGGTTACCATTTCTTTATAGTAAC 

TCGATTAGCCATATATGTTTGTTTCTAGTCCTGCTCCTTTGCTCCTCTCCTATGCCTTCC 

CAGTGCTGGCTCCATTTTGAAGACTCAAGGACAGAGGGGAAGCAGATCATAAAGAGAAAA 

AGGAGACAGAAGAAAGGATGAAGGAAGGAGGTCATGGGGAGTGTGGCTTCTGAGCAGTTT 

AGTTGCTGGGGAGAGCAGACAGTCACTGCCTACAATACAGACAGAACCTTCCTGCTCACT 

TTCTGTCCTATCTCTTCCTGACCTTATGAACCAGTGTTAGTAGATGATTAAAACATGACA 

AGCAATGGCTCCTTATTTTCACAGGACTAAGTCCGGGCCTTCGTATCACTAGCTGTTGCC 

TTTTACACCCTGCTTCAGCCACCCTGTCCCTGTCATTGGCCCTGGACTTCCTCTCTGTGC 

CCGTGTGTCCTCTGCCTGGGAGCCCTCTCCTCCCATAGTCACTTTCTCTCTGCCAAACTC 

ATTTCTTCTTGTGCCCAAGACCTCTCTCCTGAGCCCCTGTGGAAACTTCAGGAAGGATGA 

ATCCGTCTTTGTGCTCCACGGCTCGTACCTTGATCAGGCTGTGCATCACAGTAATTCCGT 

TCTAGGTAGGCAGAGTTGATCTTTGTCTCATCTGCCAGGCTGCGGGCTCTTCAAGGGCAG 

GGACCTTGTCATAGTCATTTTTATTTTCACAGTGCTTGGAACATGGTGGAAAATGAATGT 

TGGAATTATTGGAGTAATATAATTTGTATCAAATGTCCTTTTGAATTAAGAGATTTAGTT 

ATGTTTACTAAGAATGTAAACTTTGAATTGGTTTGCATTTTAACAATTAGGATGGTTTAT 

TGATGTGAATTTTGAAATGTAGAGGTATAATGTTAAATTATTTTATACTTTATGGAAATC 

AAGTGAAATGTTTGAAAAAATGCCGCCATTATCCTCTGGTATTTTCTACTCTCTGGAATT 

ATGTGCTGTAAATGATCGGCTGTAAATGTGAGGCACACCACCCACCCCTGTGTGGAAAGT 

GTTGTGGCGCTTCCTGCCACCCACCCACCTCTCTGCCGTTGCTCCTTGTGACACTTGTCT 

GTCGTCTCCCATCCAAACTCCAAGCTTACAGCTACCTCAGTACTGCTTTGCTTGTCTGAA 

ACACCTCCTTTGCCTTCCTTCAGTGTCCCGCTCAGGTGCAGCCTCCTCCCTAAAGCTCAT 

CTCAGCTTTTGATCTGAATGATGATGGAAACATGCAGACAGCCTCTCAGTCTTACTATTT 

AATGTTGTAGCTGGGAAAAAACCCAGAGAGGTTAACTGATATACTGGGTTGGGACTAGGA 

TGTGGGTTTTGTGACTCTGAA.TCCCATGTTCTCAAACTACGCTGCCTTCCGAAGTCTGGC 

ATTTGTTAGCTCATGCTTCCTTGTAGTCCAGCTTCTTATGTGCCTGTTATATTCTCCAGT 

AAGATTGTAAGCCCCTTAAGGGCAGGGACGTCTTTGCATCTCTAGCACTGCTATAGTGTT 

CTATCCTTAGTTATGAACTAGATAAATAAATGGTGGTGGCAAC 



WO 2004/081199 



43/73 



PCT7US2004/007862 



FIGURE 40 



NKSIRMTPSQVAFEIRGTLLPGEVPAICGSCDALGNWNPQNAVALLPENDTGESMLWKAT 
I VLSRGVSVQYRYPKGYFLEPKTIGGPCQVI VHKWETHLQPRSITPLESEI I IDDGQFGI 
HNGVETLDSGWLTCQTEIRLRLHYSEKPPVSITKKKLKKSRFRVKLTLEGLEEDDDDRVS 
PTVLHKMSNSLEISLISDNEFKCRHSQPECGYGLQPDRWTEYSIQTMEPDNLELIFDFFE 
EDLSEHWQGDALPGHVGTACLLSSTIAESGKSAGILTLPIMSRNSRKTIGKVRVDYIII 
KPLPGYSCDMKSSFSKYWKPRIPLDVGHRGAGNSTTTAQLAKVQENTIASLRNAASHGAA 
FVEFDVHLSKDFVPWYHDLTCCLTMKKKFDADPVELFEIPVKELTFDQLQLLKLTHVTA 
LKSKDRKESWQEENSFSENQPFPSLKMVLESLPEDVGFNIEIKWICQQRDGMWDGNLST 
YFDMNLFLDIILKTVLENSGKRRIVFSSFDADICTMVRQKQNKYPILFLTQGKSEIYPEL 
MDLRSRTTPIAMSFAQFENLLGINVHTEDLLRMPSYIQEAKAKGLVIFCWGDDTNDPENR 
RKLKELGVNGLIYDRIYDWMPEQPNIFQVEQLERLKQELPELKSCLCPTVSRFVPSSLCG 

ESDI H VDANG I DNVENA 

\ 

Signal sequence 
none 

Transmembrane domain 
none 

N-glycosylation site. 
1-4 

49-52 
333-336 
477-480 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
426-429 

N-myristoylation site. 
29-34 
119-124 
330-335 
332-337 
472-477 
476-481 

Amidation site. 
499-502 

Glycerophosphoryl diester phosphodiesterase 
328-620 

Starch binding domain 
8-115 
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GTACTCTCGCGGTATTTGTCCCGACTCTCGCGGGGTTTAGCGTGGCATTGGGAGGCCCGG 

CCTGGGGGAGGAGACGGCGTTCCGTTAGCGGCGTTGGGGTTTGGCTGCAGTGGCAGTGCT 

TTCTCTTCTGCTCACGGGGACCCGCTCAGGCTGGAGGCCAGCCAGCTCTTGCCGCCACCT 

CGGTCGCGATGGGGGCGCAGGACCGGCCGCAGTGCCACTTCGACATCGAGATCAACCGGG 

AGCCGGTTGGTCGCATTATGTTTCAGCTCTTCTCAGACATATGTCCAAAAACATGCAAAA 

ACTTCCTTTGCTTGTGCTCAGGAGAGAAAGGCCTTGGGAAAACAACTGGGAAGAAGTTAT 

GTTATAAAGGTTCTACGTTCCATCGTGTGGTTAAAAACTTTATGATTCAGGGTGGGGACT 

TCAGTGAAGGTAATGGAAAAGGTGGAGAATCAATTTATGGTGGATATTTTAAAGATGAAA 

ACTTTATTCTCAAACATGACAGAGCGTTCCTTTTATCAATGGCAAATCGAGGGAAACATA 

CCAATGGTTCCCAGTTTTTCATTACCACAAAGCCTGCTCCACACCTGGATGGGGTGCATG 

TAGTCTTTGGACTGGTTATTTCTGGTTTTGAAGTAATCGAACAAATTGAAAATCTGAAGA 

CCGATGCTGCAAGCAGACCATATGCAGATGTGCGAGTTATTGACTGTGGAGTACTTGCCA 

CAAAATCAATAAAAGATGTTTTTGAGAAAAAAAGGAAGAAACCAACTCAlfCAGAAGGCT 

CGGATTCCTCTTCCAATTCCTCCTCTTCTTCAGAATCATCTTCAGAAAGTGAACTTGAAC 

ATGAGAGAAGCAGAAGGAGGAAACATAAGAGGAGGCCAAAAGTTAAACGTTCTAAAAAGA 

GGCGAAAGGAAGCAAGCAGTTCAGAAGAGCCAAGGAATAAACATGCAATGAACCCAAAAG 

GTCACTCTGAGAGGAGTGATACCAATGAAAAAAGGTCAGTTGATTCCAGTGCTAAAAGGG 

AAAAACCTGTGGTCCGCCCAGAAGAGATTCCTCCAGTGCCTGAGAACCGATTTTTACTGA 

GAAGAGATATGCCTGTTGTTACTGCAGAACCTGAACCGAAGATTCCTGATGTTGCACCCA 

TTGTAAGTGATCAGAAACCATCTGTATCAAAGTCTGGACGGAAGATTAAAGGAAGGGGCA 

CAATTCGCTATCACACACCTCCAAGATCAAGATCCTGTTCTGAGTCAGATGATGATGACA 

GCAGTGAAACTCCTCCTCACTGGAAAGAGGAAATGCAGAGATTAAGAGCATATAGACCAC 

CTAGTGGAGAAAAATGGAGTAAAGGAGATAAGTTAAGTGACCCCTGTTCAAGCCGATGGG 

ATGAAAGAAGCTTGTCTCAGAGATCCAGATCATGGTCCTATAATGGATATTATTCAGACC 

TTAGTACAGCAAGACACTCTGGCCACCATAAAAAACGCAGAAAAGAAAAAAAGGTTAAGC 

ATAAAAAGAAAGGGAAAAAGCAGAAACACTGCAGAAGACACAAACAAACAAAGAAGAGAA 

GGATTCTTATACCGTCTGACATAGAATCCTCAAAATCTTCCACTCGAAGAATGAAATCCT 

CTTGTGATAGAGAAAGGAGTTCTCGTTCTTCCTCATTGTCATCTCATCACTCATCAAAGA 

GAGACTGGTCTAAATCTGATAAGGATGTC CAGAGCTCTTTAACCCATTCCAG CAGAGACT 
CATACAGATCAAAATCTCACTCACAGTCTTATTCTAGAGGAAGCTCAAGATCAAGGACTG 
CGTCAAAGTCCTCATCACATTCTCGAAGTAGATCAAAGTCCAGATCTAGTTCCAAGTCTG 
GGCACCGAAAGAGAGCATCAAAATCACCAAGAAAAACAGCTTCTCAGTTAAGTGAAAATA 
AACCAGTTAAAACAGAACCTTTAAGAGCAACCATGGCACAAAATGAAAATGTAGTAGTAC 
AACCAGTTGTAGCAGAAAATATTCCTGTAATACCACTGAGTGACAGTCCCCCCCCTTCAA 
GATGGAAGCCTGGACAGAAACCTTGGAAGCCCTCTTATGAGCGAATTCAGGAAATGAAAG 
CTAAAACAACCCATTTGCTACCCATCCAAAGCACTTACAGTTTAGCAAATATTAAAGAGA 
CTGGTAGCTCATCATCCTACCATAAAAGAGAAAAAAATTCGGAAAGTGATCAGAGCACTT 
ATTCAAAATACAGTGATAGAAGTTCAGAAAGCTCACCAAGGTCAAGGAGCAGATCTTCTA 
GGAGTAGATCTTATTCCAGATCATATACAAGATCACGTAGTCTAGCTAGTTCACATTCAA 
GGTCTAGGTCTCCATCATCTAGATCTCATTCACGAAATAAATACAGTGATCATTCACAGT 
GTAGTAGATCATCTTCATATACTTCTATTAGCAGTGATGATGGAAGGCGAGCTAAGAGGA 
GACTTAGATCCAGTGGGAAAAAAAATAGCGTTTCACATAAAAAGCATAGCAGCAGCTCTG 
AAAAGACACTTCACAGTAAATATGTCAAAGGTAGAGACAGGTCTTCATGTGTGAGAAAGT 
ATAGCGAGAGCAGATCATCTTTAGATTATTCTTCAGACAGTGAGCAGTCAAGTGTTCAGG 
CCACACAGTCAGCCCAGGAAAAAGAGAAGCAGGGCCAAATGGAAAGAACACATAATAAAC 

AAGAAAAAAACAGAGGTGAAGAAAAATC CAAGTCTGAACGGGAATGC C CTCATTCAAAAA 
AAAGAACTTTGAAAGAGAATCTTTCTGATCACCTTAGAAATGGCAGTAAGCCCAAAAGGA 
AGAATTATGCTGGTAGTAAATGGGACTCTGAGTCAAATTCAGAACGAGATGTCACTAAAA 
ACAGTAAAAATGACTCCCATCCATCCTCTGACAAGGAAGAAGGTGAGGCCACATCCGATT 
CTGAATCAGAGGTTAGTGAAATTCACATCAAAGTCAAACCCACAACCAAGTCGTCCACAA 
ATACTTCACTGCCTGATGATAATGGTGCTTGGAAATCAAGCAAACAGCGCACATCAACTT 
CTGACTCTGAGGGGTCCTGTTCCAATTCGGAAAACAATAGGGGAAAGCCACAAAAGCACA 
AACATGGGTCAAAGGAAAATCTTAAAAGAGAACACACCAAAAAAGTGAAAGAGAAATTGA 
AAGGGAAAAAAGACAAAAAGCATAAGGCTCCAAAACGAAAGCAAGCATTTCACTGGCAGC 
CTCCACTAGAATTTGGTGAAGAGGAGGAGGAGGAGATTGATGACAAGCAAGTTACTCAGG 
AATGAAAAGAGAAAAAAGTTTCTGAAAACAATGAAACGATAAAAGATAATATTCTAAAAA 
CTGAGAAATCCAGTGAAGAGGACCTTTCAGGTAAACATGATACAGTGACTGTTTCATCAG 
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ATCTTGATCAGTTTACTAAAGATGATAGTAAACTCAGTATTTCTCCCACAGCTTTAAATA 

CTGAGGAAAATGTGGCCTGTTTACAAAACATTCAGCACGTTGAAGAAAGTGTTCCCAATG 

GAGTGGAAGATGTGCTTCAAACAGATGACAACATGGAGATCTGCACTCCTGATAGGAGTT 

CCCCAGCAAAAGTAGAGGAGACTTCCCCTCTAGGAAATGCACGGCTTGATACCCCAGATA 

TAAACATTGTTTTGAAGCAGGATATGGCAACGGAACATCCTCAAGCAGAGGTAGTAAAAC 

AGGAAAGCAGCATGTCCGAAAGTAAAGTGTTGGGTGAAGTGGGGAAACAGGACAGCAGCT 

CTGCTAGCTTGGCTAGTGCTGGAGAAAGTACCGGGAAGAAGGAGGTGGCTGAGAAGAGCC 

AGATCAACCTCATTGATAAGAAATGGAAGCCCCTGCAAGGTGTGGGGAACCTGGCAGCAC 

CTAATGCTGCCACATCCAGTGCTGTGGAAGTTAAGGTGTTGACCACTGTGCCTGAAATGA 

AACCACAAGGCTTGAGAATAGAAATTAAAAGCAAAAATAAAGTTCGGCCTGGGTCTCTCT 

TTGATGAAGTAAGAAAGACAGCACGCTTAAACCGTAGACCAAGAAATCAGGAGAGTTCAA 

GTGATGAGCAGACGCCTAGTCGGGATGATGATAGCCAGTCCAGGAGTCCAAGTAGATCTC 

GAAGTAAATCTGAAACCAAATCAAGACACAGAACAAGGTCTGTCTCCTATAGTCACTCAA 

GAAGTCGATCGAGAAGTTCCACATCATCTTATCGATCAAGAAGCTACTCTAGAAGTCGGA 

GCAGAGGATGGTACAGCAGAGGCCGAACCAGAAGCCGGAGCAGTTCCTACCGGAGTTACA 

AAAGTCACAGGACGTCCAGCAGGAGCAGATCCAGGAGCAGCTCATATGATCCCCACAGTC 

GATCCAGGTCCTACACCTACGATAGCTACTATAGCAGGAGTCGGAGTCGAAGTAGAAGCC 

AGAGAAGTGACAGTTACCACCGAGGCAGAAGTTATAATCGGCGGTCCAGGAGTTGTAGAT 

CTTATGGCTCTGACAGTGAAAGTGACCGAAGTTACTCTCATCACCGGAGCCCCAGTGAGA 

GCAGCAGATACAGTTGAAAACGTCCGGATACAAATTATATCTTATTTGTAAATATCTGGC 

AACTTAGCTTAAGAAATGTAATGACAGTCTGTTGTTCTATTTCAATATCAGAGGTGAATT 

TCAAAAATAGACACTTCTTAATTGTTACTGGTTCATTTACATGTGGGGAGAAGAATTTAA 

AATACAGATATGTCTCCTAAAAATATTTTTATGCCACATTTTACAGTAGCCAACTATGGA 

AATGAATTTCATTTTCTTGAATCAAGAAATCGTGAAATTTATCTATGTATAATTTGCAAT 

ATTATTTTAAGTCTATTTCACTCTATCTTACGTATCCCTTAGAATACAGATTCTTTTTGC 

CTGTTTTTCCAGTTTTAGCATATATGCTGCCAAGCATAGAACTGTGAAGGAGAACTGTTA 

AAGGCGGCCAAATATTTATATACTGATTACATAGAGTCTTGTACATATGTGCTCTAAAAA 

CAAACCACCCAGAATTGATACTGTTGGTAACCAGGAGTATAAGGCAGTGGCTCTGGGGTT 

CTTAATTCATTCCTAACTTCTTTGATACTTCACAGGATTAGGAAAGTGGTCATCATACAT 

CCCACACAGTCTGTATTACTTCAGGCTTGTGGGCAAGGTTAGGAAGAATCAATCAGCCTT 

AACTATAAATACCTGCACTGTCTCTGAGGACTTACTATTTTATGTTCTTTTTAATCAATA 

CCGATCAGAAGTTTAGGTTATAAAAACAATTCTACTTCATGCTTTGGTGCTTGGTAATTT 

TTGGTGCGTCTTTAAGCATTACTCTTATATATCATATATTAAAATACCATAAAAATGAAA 

TTCAGACAAAATCACTGGCACCAAAAATGGTTTATTCTGAGCTGTCTTCACTTTGACTAT 

TTGGGGGGCTTCTCTCAAGTACAGATGTGGGTTGGGGTCCCCTGGAGCAGGCAGGATTGG 

CAGTAAGAGATATTGGCCACTCAAGTCTACTGTGTGTGTGTGCCTCTGGAAGAGTGAAGA 

ATGGACTTCAAAAGTAACATCAAAAATCTAACTGCCACCATCCTGGAGACATTTTGCAGG 

GCTTTCCTTTCAAGTCTTTCAAGTACAGGATATTACCACAACAGCAGCTGAACTGTTGTA 

ACCAGCATGTTTTTCCTATTTCCACTGTGACCTGCAGCTGACTCAAAGCCTTGCGTGACC 

TGACCCAGGTGCAAGAGACAGGGGAAGAGGGATAGAGGGTATAGCATAAATTACATATTT 

TCATGGCTTTGGGTGGTTCCTCCAAAAATAATTGGACCTGTAAAAACTAGTGTGTGTGTG 

TGTGTGTGTGTGTGTGTGTGTGTGGTTTTTTTTTTTAATCTTTACTTTGAATTTGTTCCC 

CAAGTGTACTTAATCACCTTAGTGCCAGTTTAATCCAGTTATGCAGAAGAAATTCATATT 

GGTTGCCTGATGTAGAGCTCAGCACCACCCTACCACAGGCCTTGTCTGGTGTATTTGGGA 

AGTGGAAAAGAGCCCTCAGTTGGAGGGAGCTGACAACCCTTGGTGGAGGGAGGGTGCCCT 

TGAATGTATTAAAACTATCACCCAAAGAAGGTATGAAAACAGGGTAAGGTGGTCAGTTGT 

TTGCCAGGTCAATAGACAGAAAGTACATTAGAAAACAGGACTTAGGCCAAACAAACAATA 

CTGGATACTGAATACAAAACAGTATGATTTATATTAAAGGTTTCCAAAGGTTGCCTGCAA 

AGGAGAATATTACTACTAGTCAGCAGGAAAAAAATGCATTCAGAACCCAAGCAGAAACTG 

CCAAATGTAATTAGGTTAAGAAAAGTTACCCTTGGGCAGTGTATTAGTTTTCTATTGCTG 

TGTGACAAATTACCCCAAATTTAGCAGTTTAAAAAACAATACCCATAGCAGTTCTGTAGC 

TCATGAGTCTGGCACAGTGTGGCTGGATTCTCTGCTCAGGGTCTTAAAGGCTGAAATAAG 

GGTTGGCAGGACAACATTCCTTCATGGAGGCTCTGGGGAAGAATCTGCTTCTAAGTTCAT 

TCAGGTTGTTGGCGGAATTCAGTTCTTTGCTGGCTCTCAGCTGGAGGCCCCTCTCTCACC 

TCAAGGCTGCCTGCATTCCTTCTTATGTGGTCCCCTCCAGCTTCAAACCAGCCTTCCTGC 

TCTTTCTCATGCTTCATATCTCTCTCCCGTCCTCCTGTTTTAGGGGCATATGATTAGCTC 
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AAGCCCACAGATATATTTTAAGGTTGATTGTGCGATAGAACATAATTGCAGGAGTACTGT 
CTCATCTCATCATATTCACGGGTTCTGGAGATTAGCTCATTGAAAGTGGGAGGGGCATTT 
TCAAATTCTGCCTACCACAGGCAATAACTGCCCATCTCAGCTGTAGGTGGAATTTTTACC 
CAGAAAAGATAGGCCCTAGAAGCCTCATTTCTTTTCTCCATGGAAAAGGACAGCCCTCTG 
CTGCAGCGTTCAACTTGTGTGTTTACTGACAGAGTGAACTACAGAAATAGCTTTTCTTCC 
TAAAGGGGATTGTTCTACATTTTGAAGTTATTTTTTAATAAAATTGAATTATGTTGT 
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FIGURE 42A 



MGAQDRPQCHFDIEINREPVGRIMFQLFSDICPKTCICNFLCLCSGEKGLGKTTGKKLCYK 

GSTFHRWKNFMIQGGDFSEGNGKGGESIYGGYFKDENFILKHDRAFLLSMANRGKHTMG 

SQFFITTKPAPHLDGVHWFGLVI SGFEVI EQI ENLKTDAASRPYADVRVIDCGVLATKS 

IKDVFEKKRKKPTHSEGSDSSSNSSSSSESSSESELEHERSRRRKHKRRPKVKRSKKRRK 

EASSSEEPRNICHAMNPKGHSERSDTNEKRSVDSSAKREKPWRPEEIPPVPENRFLLRRD 

MPWTAEPEPKIPDVAPIVSDQKPSVSKSGRKIKGRGTIRYHTPPRSRSCSESDDDDSSE 

TPPHWKEEMQRLRAYRPPSGEKWSKGDKLSDPCSSRWDERSLSQRSRSWSYNGYYSDLST 

ARHSGHHKKRRKEKKVKHKKKGKKQKHCRRHKQTKKRRILIPSDIESSKSSTRRMKSSCD 

RERSSRSSSLSSHHSSKRDWSKSDKDVQSSLTHSSRDSYRSKSHSQSYSRGSSRSRTASK 

SSSHSRSRSKSRSSSKSGHRKRASKSPRKTASQLSENKPVKTEPLRATMAQNENVWQPV 

VAENIPVIPLSDSPPPSRWKPGQKPWKPSYERIQEMKAKTTHLLPIQSTYSIiANIKETGS 

SSSYHKREKNSESDQSTYSKYSDRSSESSPRSRSRSSRSRSYSRSYTRSRSLASSHSRSR 

SPSSRSHSRNKYSDHSQCSRSSSYTSISSDDGRRAKRRLRSSGKKNSVSHKKHSSSSEKT \ 

LHSKYVKGRDRSSCVRKYSESRSSLDYSSDSEQSSVQATQSAQEKEKQGQMERTHNKQEK 

NRGEEKSKSERECPHSKKRTLKENLSDHLRNGSKPKRKNYAGSKWDSESNSERDVTKNSK 

NDSHPSSDKEEGEATSDSESEVSEIHIKVKPTTKSSTNTSLPDDNGAWKSSKQRTSTSDS 

EGSCSNSENNRGKPQKHKHGSKEI^KKEHTKKV^ 

EFGEEEEEEIDDKQVTQESKEKKVSENNETIKDNILKTEKSSEEDLSGKHDTVTVSSDLD 

QFTKDDS KLSIS PTALNTEENVACLQNI QHVEES VPNGVEDVLQTDDNME I CTPDRS S PA 
KVEETSPLGNARLDTPDINIVLKQDMATEHPQAEWKQESSMSESKVLGEVGKQDSSSAS 
LASAGESTGKKEVAEKSQINLIDKKWKPLQGVGNLAAPNAATSSAVEVKVLTTVPEMKPQ 
GLRIEIKSKNKVRPGSLFDEVRKTARLNRRPRNQESSSDEQTPSRDDDSQSRSPSRSRSK 
SETKSRHRTRSVSYSHSRSRSRSSTSSYRSRSYSRSRSRGWYSRGRTRSRSSSYRSYKSH 
RTSSRSRSRSSSYDPHSRSRSYTYDSYYSRSRSRSRSQRSDSYHRGRSYNRRSRSCRSYG 
SDSESDRSYSHHRSPSESSRYS 



signal sequence 
none 

Transmembrane domain 
none 

N-glycosylation site. 
119-122 
203-206 
864-867 
871-874 
901-904 
938-941 
1048-1051 

■ 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
190-193 
561-564 
764-767 
771-774 
796-799 
857-860 
1042-1045 

Tyrosine kinase phosphorylation site. 
84-90 
1453-1461 
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FIGURE 42B 



N-myristoylation site. 

48-53 

75-80 

81-86 
141-146 
174-179 
197-202 
531-536 
659-664 
946-951 
962-967 
980-985 

Amidation site. 
53-56 
329-332 
441-444 
751-754 
762-765 
998-1001 
1208-1211 



ATP/GTP -binding site motif A (P-loop) . 
45-52 

Cyclophilin-type peptidyl -prolyl cis- trans isomerase signature. 
59-76 

Cyclophilin type peptidyl -prolyl cis-trans isomerase 
8-177 
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FIGURE 43 



aiGAAAGATCCAAGTCGCAGCAGTACTAGCCCAAGCATCATCAATGAAGATGTGATTATT 

AACGGTCATTCTCATGAAGATGACAATCCATTTGCAGAGTACATGTGGATGGAAAATGAA 

GAAGAATTCAACAGACAAATAGAAGAGGAGTTATGGGAAGAAGAATTTATTGAACGCTGT 

TTCCAAGAAATGCTGGAAGAGGAAGAAGAGCATGAATGGTTTATTCCAGCTCGAGATCTC 

CCACAAACTATGGACCAAATCCAAGACCAGTTTAATGACCTTGTTATCAGTGATGGCTCT 

TCTCTGGAAGATCTTGTGGTCAAGAGCAATCTGAATCCAAATGCAACGGAGTTTGTTCCT 
GGGGTGAAGTACGGAAATATTTGA 
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FIGURE 44 



MKDPSRSSTSPSIINEDVIINGHSHEDDNPFAEYMWMENEEEPNRQIEEELWEEEFIERC 
FQEMLEEEEEHEWFI PARDLPQTMDQI QDQFNDLVI SDGSSLEDLVVKSNLNPNATEFVP 
GVKYGNI 

signal sequence 
none 

transmembrane domain 
none 



N-glycosylation site. 
114-117 



N-myristoylation site. 
121-126 
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FIGURE 45 



i^GTC^GCACCGACCCAAAC^GCAGGAAACAGGAAATGTTCACGTTTCAGAGAGGCTGC 
AGCCCGGCGCAGCATCCTGAGCGCGCCTCTGCCGAGGCGAGCGGACATGCAGGCTCCCCG 
CGCAGCCCTAGTCTTCGCCCTGGTGATCGCGCTCGTTCCCGTCGGCCGGGGTAATTATGA 
GGAATTAGAAAACTCAGGAGATACAACTGTGGAATCTGAAAGACCAAATAAAGTGACTAT 
TCCAAGCACATTTGCTGCAGTGACCATCAAAGAAACATTAAATGCAAATATA2^ATTCTAC 
CAACTTTGCTCCGGATGAAAATCAGTTAGAGTTTATACTGATGGTGTTAATCCCATTGAT 
TTTATTGGTCCTCTTACTTTTATCCGTGGTATTCCTTGCAACATACTATAAAAGAAAAAG 
AACTAAACAAGAACCTTCTAGCCAAGGATCTCAGAGTGCTTTACAGACATATGAACTGGG 
AAGTGAAAACGTGAAAGTCCCTATTTTTGAGGAAGATACACCCTCTGTTATGGAAATTGA 
AATGGAAGAGCTTGATAAATGGATGAACAGCATGAATAGAAATGCCGACTTTGAATGTTT 
ACCTACCTTGAAGGAAGAGAAGGAATCAAATCACAACCCAAGTGACAGTGAATCCTAAAC 
CTGAATGGCC'rCTCATGTTTTCCAAGAGAAGCAGCCCCTGAGGGAGTCTGCTGAGGCTGCC 
AACAGAGGA1GAAGAGGATACAAATTTAATTAATTTCAAATCAACATAGACACAAGAACC 
TTTTGCTGTTTCTTCCAACGCCCACTCTTCCTAATGATGGCATCACTTGCACTTGGAAGA 
ATGTGCAATTGAGAAGTACTAGGAAAAGGCCTGGCTGCCATCCATCGCTGCCTCTGAGGG 
TGGAGAAGGAGGCGGGTGATGTGCTCACTTCTGATCAACATGTGTTGCCTCCTCTCAGCC 
AACTTCTAGCTCACTGCACTCACTCTGGTCATGATAAATGTTCGTCACCTTTCTGCTTCA 
TTCCTTAGGGCCTAAATCAGGAAGCTGTTTTATCGATGGTTTCCTTTTGGGTCAGTAACC 
AGCTTTGGATAATTTCCTCTGATTATTCAAGTCGTGGGACAGGTAAACTACATTCAGCAG 
GAACTTTTCTCGAGGAGTGTTATGTCATGGAAAAGACACCAAACACAGCAAGTATTTTAA 
TGAATACACCATCCCAGGGGGTCAGTAAGCTCTGCCTGCCAAGAAGACACAGTGAGAGGG 
GTCCACAGTC CTGATGAGGTGGCGTTTGGTAACTTGTAGACCCTAGCATGG C CAGGTCTG 
GTCACCCTTAAGAACTTCTCAGAGAAACTAGGAATCTTCAGTGAAAGAACTAATGTTCTC 
CTCAGCTGAAATTCCCTTGCTTGTCAGCATTTCTGCAAAGCTCACACTTGTTTCACCATA 
CCTCCCTTGGATGTGACATGTAGGTAGGAAGTATGTGCAGGTGGGAGTCATCTGTCAGCC 
TTCTATGTTTCAGAGATCCTGAAGGTGGTTTGAAACAAACAGAAGAGGAGCAGGAAATAT 
CCGTGCCTGTGGCAGATCTGACTCATCATGCTTAGCATTCTCTCCCGCCAAGCTGGGATA 
AGCCTCATGTCCTAACACAGCACAACAGGAGGTCTCTGTCAGTCCATCAGAGATGACATT 
CTATGTGATATTTTTGACATCCTTGTGCTAAAAGCAATGGCACAAAATGGAAAAGGGCCT 
ATTGACCACACCTACTCCAGTAAAATTGTTCTTCATTTATTCCTTAATTTTCTAAATCTG 
ACCCCTTTAAAGCAATCTAGCAAATTGAGAATCCTCAGCTCTCCTTGGATACCTGATATT 
TTATTTCAAGAAAGAGACAAAGAAGGAAAATTTTATTTATTTTACTACCCACATATAAAC 
CGAAGGGAGATGGGACTACCCAAACATTTGCTGCTCAATTTTGTGTCTTGTGCTTGAAAG 
TCTGCCCTAATGCATAACAAAAACTACTTGTCTCCTACCTTTTGGGATCCCTTTAACAAG 
TATTTGCCTTCTGAACTACGTGGATAATTTCAAAGGCAGAGTTCCAGACCAGAGAGGTCT 
TTCCATACAATGAAAGCTATAACTAGCTGGTTGTGTTTAACCACTGCTATCACGCTATCC 
TGGGACTGCATAGAACTTTGACAAAAGACAGACTTCATGGTACACAACTTCAACAGATTT 
TCTGTCATATTCTCACCAGCACATCTGAATGAGGCTTTGTGTTTTCCTTGCTCTCTTGCA 
TGTTCCTTTTCAACTCATGGCCCACAGTGACTCTCAGAGATTTATGCCAAAATTGCATAC 
AATTGTTTTCTGAATCATAACTTGTCTATTTTTCTGCCTATGTGTGCTACTTTCAGTTTG 
TTTCTCATCAACATTTTGACTCTCAGAAGAGCCTCCATTTGCCCCTTTCTCTCTTTAGGT 
ATCTAAGATCTTTGAACACCTGGACCTTTACATTTGATCCAACCCTATCAAATAATGAAC 
TTCTCAGAGAGGCATCTGGGGTCCTGGAACTTCATGTTGATGAAGTCATATTATATAATA 
TGATAAAAATATTCTCATGCAGTATTTTAAATAATTTCAAATTCTAGAAGAAGCAAATTT 
CAGCGACATGTCATTGAGTTTTTATTTGGTAAAGCTATAATTGTGCAGTGTACAAAGCAC 
TTTTTAAAAAGATAGTTTATTCTGTCAGGGTATATGAAGTTAGTATACAGCCAGAACAGC 
CAAGCCTCAATTCTTGTACCTTGTGTCTTTTTATTACTGTTTAATCAATAGATATCATAT 
GTTTATGACAGTTTCAAGAATTGTTTTTAAACCCAAACTTAATTTTATGTTTCAGACTAT 
TGTTAGAAAAACAAAACAAAAAACAAAAAAACCCTCACTAATTTGCCCTAATTGGATAGG 
GCAATCAGGACAAAATCTGACTTTCGAATATTTAAAAGATATGTAAGTTTGATTGCATTT 
TCGTACATTTTAAGCAAACTAGGTTAACAACAACATAGCCTAGTCAAACTTCTCAGGAAA 
CTTGTTTTAATAAATATGTAAAAAT 
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FIGURE 46 



MQAPRAALVFALVIALVPVGRGNYEELENSGDTTVESERPNKVTI PSTFAAVTI KETLNA 
NINSTNFAPDENQLEFIIiMVLIPLILLVLLLLSVVFItATYYKRKRTKQEPSSQGSQSALQ 
TYELGSENVKVPIFEEDTPSVMEIEMEELDKWMNSMNRNADFECLPTLKEEKESNHNPSD 

SES 



signal sequence 
1-22 

Transmembrane domain 
76-96 



N-glycosylation site. 
63-66 

CAMP- and cGMP- dependent protein kinase phosphorylation site 

103-106 
N-myristoylation site. 
114-119 
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FIGURE 47 



CTCCGCCCTCTCCCACTCTCTCTTTCCGGTGTGGAGTCTGGAGACGACGTGCAGAAATGG 
CACCTCGAAAGGGGAAGGAAAAGAAGGAAGAACAGGTCATCAGCCTCGGACCTCAGGTGG 
CTGAAGGAGAGAATGTATTTGGTGTCTGCCATATCTTTGCATCCTTCAATGACACTTTTG 
TCCATGTCACTGATCTTTCTGGCAAGGAAACCATCTGCCGTGTGACTGGTGGGATGAAGG 
TAAAGGCAGACCGAGATGAATCCTCACCATATGCTGCTATGTTGGCTGCCCAGGATGTGG 
CCCAGAGGTGCAAGGAGCTGGGTATCACCGCCCTACACATCAAACTCCGGGCCACAGGAG 
GAAATAGGACCAAGACCCCTGGACCTGGGGCCCAGTCGGCCCTCAGAGCCCTTGCCCGCT 
CGGGTATGAAGATCGGGCGGATTGAGGATGTCACCCCCATCCCCTCTGACAGCACTCGCA 
GGAAGGGGGGTCGCCGTGGTCGCCGTCTGTGAACAAGATTCCTCAAAATATTTTCTGTTA 
ATAAATTGCCTTCATGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



\ 
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FIGURE 48 



MAPRKGKEKKEEQVI SLGPQVAEGENVFGVCHI FASFNDTFVHVTDLSGKETI CRVTGGM 
KVKADRDE S S P YAAMLAAQDVAQRC KELG I TALHI KLRATGGNRTKT PGPGAQS ALRALA 
RSGMKI GRI EDVTPI PSDSTRRKGGRRGRRL 

Signal sequence 
none 

transmembrane domain 
none 

N-glycosylation site. 

38-41 \ 
103-106 



N-myristoylation site. 
101-106 
111-116 
.123-128 
144-149 

Amidation site. 
144-147 
147-150 

Ribosomal protein Sll signature. 
131-138 



Ribosomal protein Sll 
29-147 
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FIGURE 49 



ACCACTGCTGGCTTTTTGCTGTAGCTCCACATTCCTGTGCATTGAGGGGTTAACATTAGG 
CTGGGAAGATGACAAAACTTGAAGAGCATCTGGAGGGAATTGTCAATATCTTCCACCAAT 
ACTCAGTTCGGAAGGGGCATTTTGACACCCTCTCTAAGGGTGAGCTGAAGCAGCTGCTTA 
CAAAGGAGCTTGCAAACACCATCAAGAATATCAAAGATAAAGCTGTCATTGATGAAATAT 
TCCAAGGCCTGGATGCTAATCAAGATGAACAGGTCGACTTTCAAGAATTCATATCCCTGG 
TAGCCATTGCGCTGAAGGCTGCCCATTACCACACCCACAAAGAGTAGGTAGCTCTCTGAA 
GGCTTTTTACCCAGCAATGTCCTCAATGAGGGTCTTTTCTTTCCCTCACCAAAACCCAGC 
CTTGCCCGTGGGGAGTAAGAGTTAATAAACACACTCACGAAAAGTT 
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FIGURE 50 



MTKLEEHLEGI VNI FHQYSVRKGHFDTLSKGELKQLLTKELANTI KNIKDKAVI DEI FQG 
LDANQDEQVDFQEFI SLVAI ALKAAHYHTHKE 

Signal sequence 
None 

Transmembrane domain 
None 

N-myristoylation site. \ 
60-65 

S-100/ICaBP type calcium binding domain 
4-47 
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FIGURE 51 



CTCTTGTCAGCTGTCTTTCAGAAGACCTGGTGGGGXAAGTCCGTGGGCATCATGTTGACC 
GAGCTGGAGAAAGCCTTGAACTCTATCATCGACGTCTACCACAAGTACTCCCTGATAAAG 
GGGAATTTCCATGCCGTCTACAGGGATGACCTGAAGAAATTGCTAGAGACCGAGTGTCCT 
CAGTATATCAGGAAAAAGGGTGCAGACGTCTGGTTCAAAGAGTTGGATATCAACACTGAT 
GGTGCAGTTAACTTCCAGGAGTTCCTCATTCTGGTGATAAAGATGGGCTGGCAGCCCACA 
AAAAAAGCCATGAAGAAAGCCACAAAGAGTAGCTSAGTTACTGCCCAGAGGCTGGGCCCC 
TGACATGTACCTGCAGAATAATAAAGTCATCAATACCTCAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MLTELEKMjNSIIDVYHKYSLIKGNPHAVYRDDLKKLLETECPQYIRKKGADVWFKELDI 
NTDGAVNFQEFLILVIKMGWQPTKKAMKKATKSS 



signal sequence 
none 



Transmembrane domain 
None 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 
88-91 



N-myristoylation site. 
24-29 

S-100/lCaBP type calcium binding domain 
5-48 

EF hand 

50-78 
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FIGURE 53 



TTTCTTTGCGTAACCAATACTGGAAGGCATTTAAAGGACCTCTGCCGCCTCAGACCTTGC 
AGTTAACTCCGCCCTGACCCACCCTTCCCGAISCAGTCCCTGATGCAGGCTCCCCTCCTG 

atcgccctgggcttgcttctcgcgacccctgcgcaagcccacctgaaaaagccatccSg 
ctcagtagcttttcctgggataactgtgatgaagggaaggaccctg^gatcIgS? 

CTGACTCTGGAGCCTGACCCCATCGTCGTTCCTGGAAATGTGACCCTCAGTGTCGTPPrr 
AGCACCAGTGTCCCCCTGAGTTCTCCTCTGAAGGTGGATTTAGTTTTGGAGAAGGAGGTG 

gctggcctctggatcaagatcccatgcacagactacatt^ 

ttctgtgatgtgcttgacatgttaattcctactggggagccctgcccagagcccctcS? 

acctatgggcttccttgccactgtcccttcaaagaaggaacctactcacVgcccaISS 

gaattcgttgtgcctgacctggagctgcccagttggctcaccaccgotaactaccgcata 

gagagcgtcctgagcagcagtgggaagcgtctgggctgcatcaagatcgctgcctctcta 

aagggcataeagcatggcatctgccacagcagaatggagcggtgtgaggaaggtcccttt 

tcctctgttttgtgtttgccaaggccaaactcccactctctgcccccct^aatcccctt 

tctacagtgagtccactaccctcactgaaaatcattttgtaccacttacattttaggctg 

gggcaagcagccctgacctaagggagaatgagttggacagttcttgatagcccagggcat 

ctgctgggctgaccacgttactcatccccgttaacattctctcta^^ 
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FIGURE 54 



MQSLMQAPLLIALGLLLATPAQAHLiaCPSQLSSFSWDNCDEGKDPAVIRSLTLEPDPIW 
PGNVTLS WGSTSVPLS S PLKVDLVLEKEVAGLWI KI PCTDYI GSCTFEHFCDVLDMLI P 

TGEPCPEPLRTYGLPCHCPFKEGTYSLPICSEFWPDLELPSWLTTGNYRIESVLSSSGKR 
LGCI KI AASLKGI 



Signal sequence 
1-23 

* 

Transmembrane domain 
none 

N-glycosylation site. 
63-66 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 4 
26-29 

N-myristoylation site. 
14-19 

Amidation site. 
177-180 
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FIGURE 55 



^ACGGTGGGCAAGAGCAGO^GATGCTGCAGCATATTGATTACAGGATGAGGTGCATr 
CTGCAGGACGGCCGGATCTTCATTGGCACCTTCAAGGCTTTTGACAAGCACATGAATTTG 
ATCCTCTGTGACTGTGATGAGTTCAGAAAGATCAAGCCAAAGAACTCCAAACAAGCAGAA 
AGGGAAGAGAAGCGAGTCCTCGGTCTGGTGCTGCTGCGAGGGGAGAATCTGGTCTCAATG 
ACAGTAGAGGGACCTCCTCCCAAAGATACTGGTATTGCTCGAGTTCCACTTGCTGGAGCT 

GCCGGGGGCCCAGGGATCGGCAGGGCTGCTGGCAGAGGAATCCCAGCTGGGGTTCCCATG 
CCCCAGGCTCCTGCAGGACTTGCTGGGCCAGTCCGTGGGGTTGGCGGGCCATCCCAACAG 
GTGATGACCCCACAAGGAAGAGGTACTGTTGCAGCCGCTGCAGCTGCTGCCACAGCCAGT 

ATTGCCGGGGCTCCAACCCAGTACCCACCTGGCCGTGGGGGTCCTCCCCCACCTATGGGC 
CGAGGAGCACCCCCTCCAGGCATGATGGGCCCACCTCCTGGTATGAGACCTCCTATGGGT 

CCCCCAATGGGGATCCCCCCTGGAAGAGGGACTCCAATGGGCATGCCCCCTCCGGGAATG 
CGGCCTCCTCCCCCTGGGATGCGAGGCCTTCTTTGA 
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FIGURE 56 



MTVGKSSKMLQHIDYRMRCILQDGRIFIGTFKAFDICHMNLILCDCDEFRKIKPKNSKQAE 
REE KRVLGLVLLRGENLVSMTVEGPPPKDTGI ARVPLAGAAGGPG I GRAAGRGI PAGVPM 
PQAPAGLAGPVRGVGGPSQQVMTPQGRGTVAAAAAAATASIAGAPTQYPPGRGGPPPPMG 
RGAPPPGMMGPPPGMRPPMGPPMGIPPGRGTPMGMPPPGMRPPPPGMRGLL 



signal sequence 
none 



\ 

Transmembrane domain 
none 

N-myristoylation site. 



29-34 
105-110 
113-118 
148-153 
204-209 
210-215 



Sm protein 
7-82 
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FIGURE 57 



TCCTTTGGTGCCTTGTGACCAGGGCCCTGATGGTTCATTAGATGGAGCCTTCGAGTCTTA 
GGGAGTTGCCGCAGGGTCCCCACAGCGGCTCCCGACGGTTGTGAACCAGCATCCATCCTC 
CACGGATTCCGGCAACCCGCCTGGCCCTGGACGTGTCTCAACTGGCCCGCGTGAGGGGCC 
GCCCCGGAAATGACGCGCTGCCCCGCTGGCCAAGCGGAAGTGGAGATGGCGGAGCTGTAC 
GTGAAGCCGGGCAACAAGGAACGCGGCTGGAACGACCCGCCGCAGTTCTCATACGGGCTG 
CAGACCCAGGCCGGCGGACCCAGGCGCTCGCTGCTTACCAAGAGGGTAGCCGCACCCCAG 
GATGGATCCCCCAGAGTCCCCGCATCAGAGACTTCTCCTGGGCCTCCCCCAATGGGGCCT 
CCACCTCCTTCAAGTAAGGCTCCCAGGTCCCCACCTGTGGGGAGTGGTCCTGCCTCTGGC 
GTGGAGCCCACAAGTTTCCCAGTCGAGTCTGAGGCTCGACTGATGGAGGATGTGCTGAGA 
CCTTTGGAACAGGCATTGGAAGACTGCCGTGGCCACACAAGGAAGCAGGTATGTGATGAC 
ATCAGCCGACGCCTGGCACTGCTGCAGGAACAGTGGGCTGGAGGAAAGTTGTCAATACCT 
GTAAAGAAGAGAATGGCTCTACTGGTGCAAGAGCTTTCAAGCCACCGGTGGGACGCAGCA 
GATGACATCCACCGCTCCCTCATGGTTGACCATGTGACTGAGGTCAGTCAGTGGATGGTA 
GGAGTTAAAAGATTAATTGCAGAAAAGAGGAGTCTGTTTTCAGAGGAGGCAGCCAATGAA 
GAGAAATCTGCAGCCACAGCTGAGAAGAACCATACCATACCAGGCTTCCAGCAGGCTTCA 
TAATCCTCGGTTCCCCAGACT 
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FIGURE 58 



MTRCPAGQAEVEMAELYVKPGNKERGWNDPPQFSYGLQTQAGGPRRSLLTKRVAAPQDGS 
PRVPASETSPGPPPMGPPPPSSKAPRSPPVGSGPASGVEPTSFPVESEARLMEDVLRPLE 
QALEDCRGHTRKQVCDDI SRRLALLQEQWAGGKLSI PVKKRMALLVQELSSHRWDAADDI 
HRSLMVDHVTEVSQWMVGVKRL I AEKRSLFS EE AANEE KS AATAE KNHT I PGFQ QAS 

signal sequence 
none 



Transmembrane domain \ 
none 



N-glycosylation site. 
227-230 



N-myristoylation site. 
91-94 
97-100 
151-154 
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FIGURE 59 A 



GGGGGTAGCTGGCGGACCAGAGCCGGTAGCGAGGTTGGGAGAGACGGAGCGGACCTCAGC 

GCTGAAGCAGAAGTCCCCGGAGCTGCGGTCTCCCCGCCGCGGCTGAGCCATGCGGCTCCC 

TGACCTGAGACCCTGGACCTCCCTGCTGCTGGTGGACGCGGCTTTACTGTGGCTGCTTCA 

GGGCCCTCTGGGGACTTTGCTTCCTCAAGGGCTGCCAGGACTATGGCTGGAGGGGACCCT 

GCGGCTGGGAGGGCTGTGGGGGCTGCTAAAGCTAAGAGGGCTGCTGGGATTTGTGGGGAC 

ACTGCTGCTCCCGCTCTGTCTGGCCACCCCCCTGACTGTCTCCCTGAGAGCCCTGGTCGC 

GGGGGCCTCACGTGCTCCCCCAGCCAGAGTCGCTTCAGCCCCTTGGAGCTGGCTGCTGGT 

GGGGTACGGGGCTGCGGGGCTCAGCTGGTCACTGTGGGCTGTTCTGAGCCCTCCTGGAGC 

CCAGGAGAAGGAGCAGGACCAGGTGAACAACAAAGTCTTGATGTGGAGGCTGCTGAAGCT 

CTCCAGGCCGGACCTGCCTCTCCTCGTTGCCGCCTTCTTCTTCCTTGTCCTTGCTGTTTT 

GGGTGAGACATTAATCCCTCACTATTCTGGTCGTGTGATTGACATCCTGGGAGGTGATTT 

TGACCCCCATGCCTTTGCCAGTGCCATCTTCTTCATGTGCCTCTTCTCCTTTGGCAGCTC 

ACTGTCTGCAGGCTGCCGAGGAGGCTGCTTCACCTACACCATGTCTCGAATCAACTTGCG 

GATCCGGGAGCAGCTTTTCTCCTCCCTGCTGCGCCAGGACCTCGGTTTCTTCCAGGAGAC 

TAAGACAGGGGAGCTGAACTCACGGCTGAGCTCGGATACCACCCTGATGAGTAACTGGCT 

TCCTTTAAATGCCAATGTGCTCTTGCGAAGCCTGGTGAAAGTGGTGGGGCTGTATGGCTT 

CATGCTCAGCATATCGCCTCGACTCACCCTCCTTTCTCTGCTGCACATGCCCTTCACAAT 

AGCAGCGGAGAAGGTGTACAACACCCGCCATCAGGAAGTGCTTCGGGAGATCCAGGATGC 

AGTGGCCAGGGCGGGGCAGGTGGTGCGGGAAGCCGTTGGAGGGCTGCAGACCGTTCGCAG 

TTTTGGGGCCGAGGAGCATGAAGTCTGTCGCTATAAAGAGGCCCTTGAACAATGTCGGCA 

GCTGTATTGGCGGAGAGACCTGGAACGCGCCTTGTACCTGCTCGTAAGGAGGGTGCTGCA 

CTTGGGGGTGCAGATGCTGATGCTGAGCTGTGGGCTGCAGCAGATGCAGGATGGGGAGCT 

CACCCAGGGCAGCCTGCTTTCCTTTATGATCTACCAGGAGAGCGTGGGGAGCTATGTGCA 

GACCCTGGTATACATATATGGGGATATGCTCAGCAACGTGGGAGCTGCAGAGAAGGTTTT 

CTCCTACATGGACCGACAGCCAAATCTGCCTTCACCTGGCACGCTTGCCCCCACCACTCT 

GCAGGGGGTTGTGAAATTCCAAGACGTCTCCTTTGCATATCCCAATCGCCCTGACAGGCC 

TGTGCTCAAGGGGCTGACGTTTACCCTACGTCCTGGTGAGGTGACGGCGCTGGTGGGACC 

CAATGGGTCTGGGAAGAGCACAGTGGCTGCCCTGCTGCAGAATCTGTACCAGCCCACAGG 

GGGACAGGTGCTGCTGGATGAAAAGCCCATCTCACAGTATGAACACTGGTACCTGCACAG 

CCAGGTGGTTTCAGTTGGGCAGGAGCCTGTGCTGTTCTCCGGTTCTGTGAGGAACAACAT 

TGCTTATGGGCTGCAGAGCTGCGAAGATGATAAGGTGATGGCGGCTGCCCAGGCTGCCCA 

CGCAGATGACTTCATCCAGGAAATGGAGCATGGAATATACACAGATGTAGGGGAGAAGGG 

AAGCCAGCTGGCTGCGGGACAGAAACAACGTCTGGCCATTGCCCGGGCCCTTGTACGAGA 

CCCGCGGGTCCTCATCCTGGATGAGGCTACTAGTGCCCTAGATGTGCAGTGCGAGCAGGC 

CCTGCAGGACTGGAATTCCCGTGGGGATCGCACAGTGCTGGTGATTGCTCACAGGCTGCA 

GACAGTTCAGCGCGCCCACCAGATCCTGGTGCTCCAGGAGGGCAAGCTGCAGAAGCTTGC 

CCAGCTCTAGGAGGGACAGGACCTCTATTCCCGCCTGGTGCAGCAGCGGCTGATGGACTG 

AGGCCCCAGGGATACTGGGCCCTCTTCTCAGGGGCGTCTCCAGGACCCAGAGCTGTTCCT 

GCTTTGAGTTTCCCTAGAGCTGTGCGGCCAGATAGCTGTTCCTGAGTTGCAGGCACGATG 

GAGATTTGGACACTGTGTGCTTTTGGTGGGGTAGAGAGGTGGGGTGGGGTGGGGTGGGGG 

CTGTCTGTGTCCAGGAAACTTAATTCCCTGGTGACTAGAGCTTTGCCTGGTGATGAGGAG 

TATTTTGTGGCATAATACATATATTTTAAAATATTTTCCTTCTTACATGAACTGTATACA 

TTCATATAGAAAATTTAGACAATATAAAAAAGTACAAAGAAGAAAAGTAAAAGTACCCAT 

TGTTTCACTTCCTGGAGATAACCATAGTTGCTATTTTGCTGCCTGTCCCATCAGTCGTTT 

ATCTGTTGTTTGAGATAGAAATTAACCAAAAATGACATAAATATTCATGAGATTGCCTTC 

CTATATCCTTCCTTGTTCCTACCAGTGTCTGCTATTTTGAAGAAGCTAGGGTCTGGAGGG 

ACAGAGAACAGTTCCCTGATTAACAGTATTAATAGCGACATTGGTAACAGCTACCATTTA 

TAGAGTTTTAATGGGAGTAGGAGCTATGCTAAGTGTTTTTCATGTATTATCGTTTTTAAT 

CATTATCCCCAACCTATGANNlSnS^^ 

NNNNNNNN^ 

GGCCTGACCCTAGAGTCCACACCCTGACCCAATCAATTATATTGCATCTTGGGTCCATAA 
ACCCTAATCCATAATCCCATCAAGAAAAGCTCTGCTGCTCTTAGCTCTAAATAATTCAGA 
ATCTATTCTCTTCTCTCCAGTCCCGTTGTTATAGTCTTCACTCATAGACTTAAGATGATC 
CCATCACCAGAGAGGTTTCTCTACCATTAGCTTCCCTCTTCCGGCCATTCTTCACAAAGT 
CATTTTTCTAAATTCTGTGTCACATACGATGATGGCATTTCTGGAAATTCCTTCAGGTGC 
TCTCAAGCCCTGCTGCAGAGATCCTTTTCAGAGCACACACTGTTCCAGCCCATCTGTCTC 
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ACCCTCTCCTGTTGTATCCAGCTCCACGACAAACTTCTGCCTTCCCCAACACCTTTGTGC 
CTTTGCATATGGTGTTTTCTTGCCCATTTTCTGCTCGACTCGCCCCTGATTTTCAAGTTC 
AAGACTTAACTCAGGGTTCAGGTCTTCCAGGAGGCCTTACTTATGTCGTCAGTCTGGGGA 
ACTCTCCATGTGCTTCTATCACTGTGCGGTTACCTCTTTCACAGCCCTTTTAAAGTTCTA 
TCTTCCCTTTCCCACCTTTTTTGACCTTCCACTAGACCATGAGCACCTGGGCGGAAAGCC 
ATATATCTTATTAAGCTTTATATCTGCTACCTGGCCGAGGGCCTAATTCATAGTGGAGAA 
TAAATAGTCAATTGAATAAATGAATAAATATCTCCACCATCGTACTAATCTTAATCCTCC 
CTGCCCACTCCCACCACTGAAAATGCAACATTGTACACATCACTGGTTGTTGGGAGGGAC 
TTACCTTGGAAAGTTGCTATTCTAGGAAAGAGAAACCTTCATATTCCTGGAAACAGCAGG 
TAGTTTCCAGTGCTGGCAATGAATTCCCCAGAACTGCTGTTTTGGATTTTTTCTTGCCTG 
GCAGCTGTTGGGAGCAGGGTGCAGTGAGGATGGGGTGAGAGTGGGCAGTTTCTTGTGCAG 
ATTTGCCTTTCTTTCATCCTGGGGCTGACTTGCAGCTCCACACCCATCCATCTCTCAAAT 
TTCACAGAGGGTAAAATAGGCATTTGGAGAGAAAGAACTCTGGCCTGATTCCTTTCTCTC 
CCACAAATGTCCTTTATTCATAAAACAGGAATAATAATTCCTGTATCTCCCAACTACATG 
GAAGCTGCAGCCCTCACAGAAGAAGATGATCTGAGAAATTCTTTGATTTCCTCAGTACAG 
TTATACCCATGCATCATAATACTTTAAGCCTGGAAGGCATCTTAAAAATAATGCAACAGT 
CAAACCTAATTTTACAGAGAAACTGACATGAAATCACGCAGCTAATCATGATAAAGCTGG 
GTGGAAAACTTATCTTGATGGGCAGTACAGGAAGATGCAGTAGACCTTAAGATGTCCTGA 
AAGTTTCTTATCTCAGGGGAAACTCCCAGGTAGGCTTTATGTCAGGGACACAGAAAAATG 
CTCCCTGAAAGTCAAAATATTCGGGCTAGACAGACAAATTCCTGTAAGTGTGGTTTGTCT 
GGGAACCACAGATGTCACTAATCCTGGTTTGCTCCAGAGTTCTTTTTGTTCACTCCTACC 
CCCCATCACCATTTGATTGATCTCCTTACCCTGTAATTTCCCCTTCTTGTCGCTTACCTG 
CAGTATCTTTCCCACCCAGGCATGCCTTATTCTTTCTAAAGGAAAGTATGAATGGAGAGG 
GGAAAGCTTGGGAAACTGATAGATTTCCTTGGATGCCAAAACACCTCCATAGCCTGTCTG 
CCCGGCCCTATGTGGAAACAGCATTGAGTTTCAAGTCCTTTATGCCTCCACCCAGGGATA 
GCCACTTGGAATCCACATGGCAATTGTGAAACAAGCAGGAAATGCGTAATTGTCAGAATT 
TTGTGGGGAAAGGACTAGGGAATAAGGAAAACAAAGATCTTCCTTGTGTTTTAGAGCTGT 
CAGCTAGAGGAGCACCTGCTTGAGTCTGATGCCATCTAATGGTCCCAGAAGAAACTGGGT 
TTTGAACCTAGAGTTCCATGGACTCTTAGGAATTAGACTACTACTACTACTAAGCATTCA 
CTGGTGCTTACTATGTGCTATTGCTGTGCCAAGTATCTGAAACCTGTCTTCTTACCTTAT 
TTTTCAAGATAATTCTATGTGGCAGGTATTACTATCTCAATTCTAAGAGTGAGAAAATGG 
AGTTTTAGAAACATTTACTAACTTGCCTGGGTCACATAGCTAAGGAAGAGGTGGACTTGC 
CCAGCTTTGCATAAAACTCCTCAAAAGAGTTGCCTATACTCCCTGACTCCACTTATCTTC 
CTACTATCCTCTTTTTAAAATATATTATTTATTTATTTAAATAAGCAATATATGAATGTG 
GTTTGAAATTCAAAAGACACAAAGAAGTATACAGAGGAAAGCCTCACTCTCAATCCTTCT 
CAAGGTTTGCTAATTCCTCTTGCATAGGCAATCCGTTCTTCCAGCTTTGTGTTTATCTTT 
CCAGAGAAGTTTACTGTGTATTAAGCAAATATGTATATCTTTATTCTTGCTCAGTATTTT 
CGCAAACAGCAGCTGTCTAAGTTCACTGTTCTGAACTTTATTTTTTAAATTAAAAATATA 
TGGCTATGTAGTATTCTATTTTATGGAAGTTCCATATTTCATTTATCCTGTTTCCTTCTA 
CTGATGGCTAGTTAGGTTATTGGAAGTCTTTTGCTGTTGCTAGTTAGTCTTGTATAGACA 
TTGTAATGCACATGTGCAAAAATACAAGTATGATACAATCTTAAAA 
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MRLPDLRPWTSLLLVDAALLWLLQGPLGTLLPQGLPGLWLEGTLRLGGLWGLLKLRGLLG 
FVGTLLLPLCLATPLTVSLRALVAGASRAPPARVASAPWSWLLVGYGAAGLSWSLWAVLS 
PPGAQEKEQDQVNNK^MWRLLKLSRPDLPLLVAAFPFLVLAVLGETLIPHYSGRVIDIL 
GGDFDPHAFASAIFFMCLFSFGSSLSAGCRGGCFTYTMSRINLRIREQLFSSLLRQDLGF 
FQETKTGELNSRLSSDTTLMSNWLPLNANVLLRSLVKW'GLYGFMLSISPRLTLLSLLHM 
P FTI AAEKVYNTRHQ EVL RE I QDAVARAGQWREAVGGLQTVRS FGAEEHE VCR YKEALE 
QCRQLYWRRDLERALYLLVRRVLHLGVQMLMLSCGLQQMQDGELTQGSLLSFMIYQESVG 
SWQTLVYIYGDMLSMVGAAEKVFSYI4DRQPNLPSPGTLAPTTLQGVVKFQDVSFAYPNR 
PDRPVLKGLTFTLRPGEVTALVGPNGSGKSTVAALLQNLYQPTGGQVLLDEKPISQYEHC 
YLHSQ WS VGQE P VLFSGS VRNNI AYGLQS CEDDKVMAAAQAAHADDF I QEMEHGI YTDV 

GEKGSQLAAGQKQRLAIARALVRDPRVLILDEATSALDVQCEQALQDWWSRGDRTVLVIA 
HRLQTVQRAHQILVLQEGK^QKLAQL 

\ 

signal sequence 
none 

transmembrane domain 

5-25 

46-66 

62-82 . 

98-118 
146-166 
185-205 

N-glycosylation site. 

505- 508 

Tyrosine kinase phosphorylation site. 
368-376 

N-myristoylation site. 
47-52 
110-115 
202-207 
208-213 
211-216 
337-342 
407-412 
488-493 

506- 511 
558-563 
567-572 
604-609 

Cell attachment sequence. 
651-653 

ATP/GTP-binding site motif A (P-loop) . 
503 -510 

Leucine zipper pattern. 
6-27 
371-392 

ABC transporter 
496-678 
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GAATTCCGAAAGCAAGGAGMGGCCACCAAGGAGAAGCTGCAGTGTCTGAAAGATTTCCA 
CAAGGACATGGTGAAGCCCTCACCAGGGAAGAGCCCAGGCACGCGGCCTGAGGACGAGGC 
TGAGGGAAAACCTCCGCAGAGGrGAGAAGTGGTCTAGCAAGATCGACTTTGTGCTCTCTGT 
GGCTGGCGGCTTCGTGGGCTTGGGCAACGTCTGGCGCTTCCCGTACCTCTGCTACAAGAA 
TGGTGGAGGTGCGTTTCTCATACCGTATTTTATTTTCCTGTTTGGGAGCGGCCTGCCTGT 
GTTTTTCTTGGAGATCATCATAGGCCAGTACACCTCTGAAGGGGGCATCACCTGCTGGGA 
AAAGATCTGCCCCTTGTTCTCTGGTATCGGCTATGCCTCCGTTGTAATTGTGTCCCTCCT 
GAATGTCTACTACATCGTCATCCTGGCCTGGGCCACATACTACCTGTTCCAGTCCTTCCA 
GAAGGAGCTGCCCTGGGCACACTGCAACCACAGCTGGAACACACCTCACTGCATGGAGGA 
CACCATGCGCAAGAACAAGAGTGTCTGGATCACCATCAGCTCCACCAACTTCACCTCCCC 
TGTCATCGAGTTCTGGGAGCGCAACGTGCTGAGCTTGTCCCCTGGAATCGACCACCCAGG 
CTCTCTGAAATGGGACCTCGCTCTCTGCCTTCTTTTAGTCTGGCTAGTGTGTTTCTTCTG 
CATtTGGAAGGGCGTCAGGTCCACTGGGAAGGTCGTCTACTTCACAGCCACTTTTCCATT 
CGCCATGCTCCTGGTGCTGCTGGTCCGAGGGCTGACGCTGCCGGGCGCGGGCCGAGGCAT 
CAAGTTCTATCTGTATCCTGACATCACCCGCCTTGAGGACCCACAGGTGTGGATTGACGC 
TGGGACTCAGATATTCTTCTCTTATGCCATCTGCCTGGGGGCTATGACCTCGCTGGGGAG 
CTACAACAAGTACAAGTATAACTCGTACAGGGACTGTATGCTGCTGGGATGCCTGAACAG 
TGGTACCAGTTTTGTGTCTGGCTTCGCAATTTTTTCCATCCTGGGCTTCATGGCACAAGA 
GCAAGGGGTGGACATTGCTGATGTGGCTGAGTCAGGTCCTGGCCTGGCCTTCATTGCCTA 
CCCAAAAGCTGTGACAATGATGCCGCTGCCCACATTTTGGTCCATTCTTTTTTTTATTAT 
GCTTCTCTTGCTTGGACTGGATAGCCAGTTTGTTGAAGTTGAAGGACAGATCACATCCTT 
GGTTGATCTTTACCCATCCTTCCTAAGGAAGGGTTATCGTCGGGAAATCTTCATCGCCTT 
CGTGTGTAGCATCAGCTACCTGCTGGGGCTGACGATGGTGACGGAGGGTGGCATGTATGT 
GTTTCAGCTCTTTGACTACTATGCAGCTAGCGGTGTATGCCTTTTGTGGGTTGCATTCTT 
TGAATGTTTTGTTATTGCCTGGATATATGGAGGTGATAACCTTTATGATGGTATTGAGGA 
CATGATTGGCTATCGGCCCGGGCCCTGGATGAAGTACAGCTGGGTGATCACTCCAGTTCT 
CTGTGTTGGATGTTTCATCTTCTCGCTCGTCAAGTACGTACCCCTGACCTACAACAAAAC 
ATACGTGTCCCCAACTTGGGCCATTGGGCTGGGCTGGAGCCTGGCCCTTTCCTCCATGCT 
CTGCGTTCCCTTGGTCATCGTCATCCGCCTCTGCCAGACTGAGGGGCCGTTCCTTGTGAG 
AGTCAAGTACCTGCTGACCCCAAGGGAACCCAACCGCTGGGCTGTGGAGCGCGAGGGAGC 
CACACCTTACAACTCTCGCACCGTCATGAACGGCGCTCTCGTGAAACCGACCCACATCAT 
TGTGGAGACCATGATGTGAGCTCTCTCGGGTCGACGGGGCCGGCGGCTTTCCTGCTGTTT 
ACTAACATTAGATTCACATAGGACCAGGTTTACAGAGCTTTATATTTGCACTAGGATTTT 
TTTTTTTTTGTAATTGTCACAGAAAATGTAATTGTGGGTATGTGTGCGTGCGTGTGTGTG 
TGTGTGTGTGTGTATCGTGTGTGTGTGTTTTGTTTTGATTTGGGGGATATTTTGTACAAA 
AAGAAAACCCACGGGAAGATGTCCGTGGAGAGGCAGAGCTTTCATACTGAATTAGATGTA 
TTTTATGGGAATTTGGTAAATTTTTCTTTGTATTTTTTTTTTTACATATAAGTATATATA 
CACTTAGAGATTGTCATATACTTTTACCACTTGAATTGATCTTCTTGCCAGCAATAGATC 
TCATTTTCAAAAGCAATTCTTCGGTGCTGTGTAGCTGGCAGAAAGTTCTGTCCAGTAAAC 
GCAGGATGGAATTTTCCTGGGACTCTACACCCATCTTAAGGTGGTATACCTTCCAAATCC 
TGGTTCAGATGGAAGAAATAGCAGGAGAGAGGACCCATTAGCTGGCAGACCCAGGGGAAG 
AAAGGAGGGCTGTGAGGAGATACCTCATTAAACTTGGCTTAGTGAAGAAGAGAGATGCCA 
AAGGAATGAACCAACCCTTCACATAAAGGAGACTGGCTGAAGCTGAATGAGGAGGCCCTA 
TAGCAGAAGTCTGATTCTAAGAGCAGTAGAAACTTGTACCAGAAGCAAAATCCCACTTTT 
AATTTTGAGATGGTGAGTGGATAGTCAGTAGACCGTCAGAACCACTGGCCAGAGAGGGAG 
CTGCTAGAGATCCAAGAAGGCTGGCAGGAATGAGGCTCACAACTCAGCCTCGCAAGAGGT 
GGCAGAGGCACAGGAGGCCACAGTCCTTCCTGGGGCATTCCAGGCAGAGAAGGAGCAGAG 
GCTCTCCCGGCAGGAGCTGGGGTCTCAGGGCTCAGATGAGTCTGTTGCATTTGAATGGGG 
TCATAGCAGGTTCTGGTCATTCCCCAAGCAACATCTCAGCATCTCTTAAAGTTGCCTGCA 
GGAATGAAGCATGACATACCTGTTGAGGGACTAGGGGAGTGGTGGGGAGGTGAGTGGACC 
AAAGGATATAGGCCCCAGGCATGCAGATGGGCCCGGTGTCGGGGAGGGGTGCTTTCTTTC 
CTCATCTCCCCACTCCCCACTCTCAGCCTGGGAGACTCCTGCCAAGCCCTCATTAAAGAT 
GCCACCCTGGGCTGCCCTGGCACCTAGCAAGGCACACCAAGAACAGCTTTTGAGTCGTAT 
CCTCCACTGGGGAAGTGCTCCCAGTTCAGAACAAGGGCAGCCCGTGGTGCTGACCTAGGA 
TATAACAAAGCTCTTCACTTCAAAACCCCTGCAATAGCTGGGTTTACAGACATTTACCAC 
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CTGGGGACCCAAAAGAGAAGGCCTAGGAGAGTTTTCTAGAAGGTTGGGATTGTCAGGGTC 

CTGGCCCCCCAGAACTGGCTTGATCAAGGGCCTTATGTGGAGCAGAGGTTGTCTCTGAAC 

CAGGAGAGAAGGTACTATACCTTTCAAATCCCCAGGGCAGACACACCCCCACCCAGCCCC 

TATTTGGACCTAAACTGTGCCATTTGAACAGTCACTTCCAAGCTCAGTCTAAATGAAACC 

GAAACGTGACCACGCACAAAGGCAGTCACTGCTCGAGGGGTGCAGACCGCAGAATTTTCA 

CAGCAGGGGCTCTTGGAACTCTGGAAACCCCCTTCTTAAATTTGGGAGGAGGAGTATGCC 

TTTGGTGTCCCCCTCCCAAGGGCAATTCTGAACCCCATCTTTGGCAGGCATACATATTTC 

ACTGTTTCCAAAGCTATCTACTCTGCCAAACAACACCCAGTCCTATTCCAAACTCTCAAC 

GATTCTATCTTGTTCCTGTTTTTCTATGTATTTATGGTTGCCGTTTGTGTCTGATTTGAT 

TTTACTGTTTTTTCCCTGATTTTATGGAGTAGCATTGTGACCTGTTTTCCTTTGTCTTAT 

ATAACTTTAGTAAACTAACCACTGTCAATGATTGAGGGCAGGTGGCACGTGGGGAAGAGG 
GCGGAATTC 
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MATKEKLQCLICDFHKDMVKPSPGKSPGTRPEDEAEGKPPQREICWSSKIDFVLSVAGGFVG 
LGNVWRFPYLCYKNGGGAFLI PYFI FLFGSGLPVFFLEI I IGQYTSEGGITCWEKICPLF 
SGIGYASWIVSLLJSTVYYIVIIiAWATYYLFQSFQKELPWAHCl^SWNTPHCMEDTMRKNK 
SVWITISSTNFTSPVIEFWERNVLSLSPGIDHPGSLKWDLALCLLLVWLVCFFCICKGVR 
STGKWYFTATFPFAMLLVLLVRGLTLPGAGRGIKFYLYPDITRLEDPQVWIDAGTQIFF 
S YAI CLGAMTSLGS YNKYKYNS YRDCMLLGCLNSGTS FVSGFAI FS I LGFMAQEQGVDI A 
DVAESGPGLAFIAYPKAVTMMPLPTFWSILFFIMLLLLGLDSQFVEVEGQITSLVDLYPS 
FLRKGYRREIFIAFVCSISYLLGLTMVTEGGMYVFQLFDYYAASGVCLLWVAFFECFVIA 
W I YGGDNL YDG I EDM I GYRPGPWMKYS WVI TPVLCVGCFI F S LVKYVPLTYNKT YVS PTW 
AIGLGWSLALSSMLCVPLVIVIRLCQTEGPFLVRVKYLLTPREPNRWAVEREGATPYNSR 
TVMNGALVKPTHI I VETMM 

\ 

signal sequence 
none 

transmembrane domain 

47-67 

71-91 

87-107 
122-142 
216-236 
249-269 
291-311 
327-347 
382-402 
426-446 
463-483 
505-525 
544-564 

N-glycosylation site. 
163-166 
179-182 
190-193 
532-535 

Glycosaminoglycan attachment site. 

121- 124" 
365-368 

N-myristoylation site. 
56-61 
102-107 
108-113 

122- 127 
238-243 
269-274 
307-312 
330-335 
356-361 
409-414 
543-548 
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Sodium: neurotransmitter symporter family signature 1. 
65-79 

Sodium: neurotransmitter symporter family signature 2. \ 
148-168 

Sodium neurotransmitter symporter family 
41-581 
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GCTCTAGCCGGCCGTCTGGTGGCCCGAGGTGGCGGCGGGCTGGGCGCGGGGCGCGATCAG 
CGGCGCCTGCACGAGCTACGTGAGCGCAGAGCAGGAGGTGGTGCGCGGCTTCAGCTGCCC 
GCGGCCGGGGGGCGAGGCGGCCGCTGTCTTCTGCTGCGGCTTCCGCGACCACAAGTACTG 
CTGCGACGACCCGCACAGCTTCTTCCCCTACGAGCACAGCTACATGTGGTGGCTCAGCAT 
TGGCGCTCTCATAGGCCTGTCCGTAGCAGCAGTGGTTCTTCTCGCCTTCATTGTTACCGC 
CTGTGTGCTCTGCTACCTGTTCATCAGCTCTAAGCCCCACACAAAGTTGGACCTGGGCTT 
GAGCTTACAGACAGCAGGCCCTGAGGAGGTTTCTCCTGACTGCCAAGGTGTGAACACAGG 
CATGGCGGCAGAAGTGCCAAAAGTGAGCCCTCTCCAGCAGAGTTACTCCTGCTTGAACCC 
GCAGCTGGAGAGCAATGAGGGGCAGGCTGTGAACTCCAAACGCCTCCTCCATCATTGCTT 
CATGGCCACAGTGACCACCAGTGACATTCCAGGCAGCCCTGAGGAAGCCTCTGTACCCAA 
CCCTGACCTATGTGGACCAGTCCCATAAACAT 
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MSGACTSYVSAEQEWRGPSCPRPGGEAAAVFCCGPRDHKYCCDDPHSFFPYEHSYMWWL 
SIGALIGLSVAAWLLAFIVTACVLCYLFISSKPHTKLDLGLSLQTAGPEEVSPDCQGVN 
TGMAAE VPKVS PLQQS YS CLNPQLE SNEGQAVNS KRLLHHCFMATVTTSD I PGS PE EAS V 
PNPDLCGPVP 



Signal sequence 
None 



Transmembrane domain 
61-81 

N-myristoylation site. 



3-8 
25-30 
63-68 
~ 67-73 
118-123 
149-154 



